FIGURE 1 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGGAGGCACAGGTGGCCCCCACCACCCGGAGG 
AGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGA 
AGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCT 
TCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTAC CGGC C CGGCCGTTAGGGTGTGTGCT 
GTCCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCC 
TCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATTTATAGGACCGCCTAC 
CGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAG 
GACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAG 
GGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGC C CTGCAGGATGGCGGGGTGAC ACTTGCCAG 
TCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGC 
CGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTG 
TGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAG 
GAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCT 
GGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCC 
TCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTC 
CTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGG 
CTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTC 
CAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTTTTCCTCCTC 
CCCTTCCCTCGGGAGGGTCCCCAGACCCTGGCATGGGATGGGCTGGGATTTTTTTTGTGAAT 
CCAGCCCTGGCTACCCCCACCCTGGTTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCA 
GCTGAGGGAAGGTACGAGTTCCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCC 
CGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAA 
CGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCT 
AGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGT 
TACAAAT 



FIGURE 2 



MTDSPPPGHPEEKATPPGGTGHEGLSGGAADVASGVGSGRHRARLPARPLGCVLSRAHGDPV 
SESFVQRVYQPFLTTCDGHRACSTYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLEGAC 
GAAICQPPCRNGGSCVQPGRCRCPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCW 
EGHSLSADGTLCVPKGGPPRVAPNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLAS 
QALEHGLPDPGSLLVHSFQQLGRIDSLS EQ I S FLEEQLGSCSCKKDS 

Signal sequence: 

amino acids 1-19 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 93-97, 270-274 

N-myristoylation sites. 

amino acids 19-25, 78-84, 97-103, 100-106, 103-109, 157-163, 
191-197, 265-271 

Amidation site. 

amino acids 26-30 

Aspartic acid and asparagine hydroxylation site. 

amino acids 152-164 

Cell attachment sequence. 

amino acids 13 0-133 



EGF-like domain cysteine pattern signature. 

amino acids 123-135 



FIGURE 3 



CGCTCGCCCCGTCGCCCCTCGCCTCCCCGCAGAGTCCCCTCGCGGCAGCAGATGTGTGTGGG 
GTCAGCCCACGGCGGGGACTATGGTGAAATTCCCGGCGCTCACGCACTACTGGCCCCTGATC 
CGGTTCTTGGTGCCCCTGGGCATCACCAACATAGCCATCGACTTCGGGGAGCAGGCCTTGAA 
CCGGGGCATTGCTGCTGTCAAGGAGGATGCAGTCGAGATGCTGGCCAGCTACGGGCTGGCGT 
ACTCCCTCATGAAGTTCTTCACGGGTCCCATGAGTGACTTCAAAAATGTGGGCCTGGTGTTT 
GTGAACAGCAAGAGAGACAGGACCAAAGCCGTCCTGTGTATGGTGGTGGCAGGGGCCATCGC 
TGCCGTCTTTCACACACTGATAGCTTATAGTGATTTAGGATACTACATTATCAATAAACTGC 
ACCATGTGGACGAGTCGGTGGGGAGCAAGACGAGAAGGGCCTTCCTGTACCTCGCCGCCTTT 
CCTTTCATGGACGCAATGGCATGGACCCATGCTGGCATTCTCTTAAAACACAAATACAGTTT 
CCTGGTGGGATGTGCCTCAATCTCAGATGTCATAGCTCAGGTTGTTTTTGTAGCCATTTTGC 
TTCACAGTCACCTGGAATGCCGGGAGCCCCTGCTCATCCCGATCCTCTCCTTGTACATGGGC 
GCACTTGTGCGCTGCACCACCCTGTGCCTGGGCTACTACAAGAACATTCACGACATCATCCC 
TGACAGAAGTGGCCCGGAGCTGGGGGGAGATGCAACAATAAGAAAGATGCTGAGCTTCTGGT 
GGCCTTTGGCTCTAATTCTGGCCACACAGAGAATCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCAGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGAGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTCCTTGCCCCCAGCTCTGTGCTGCG 
GATCATCGTCCTCATCGCCAGCCTCGTGGTCCTACCCTACCTGGGGGTGCACGGTGCGACCC 
TGGGCGTGGGCTCCCTCCTGGCGGGCTTTGTGGGAGAATCCACCATGGTCGCCATCGCTGCG 
TGCTATGTCTACCGGAAGCAGAAAAAGAAGATGGAGAATGAGTCGGCCACGGAGGGGGAAGA 
CTCTGCCATGACAGACATGCCTCCGACAGAGGAGGTGACAGACATCGTGGAAATGAGAGAGG 
AGAATGAATAAGGCACGGGACGCCATGGGCACTGCAGGGACGGTCAGTCAGGATGACACTTC 
GGCATCATCTCTTCCCTCTCCCATCGTATTTTGTTCCCTTTTTTTTGTTTTGTTTTGGTAAT 
GAAAGAGGCCTTGATTTAAAGGTTTCGTGTCAATTCTCTAGCATACTGGGTATGCTCACACT 
GACGGGGGGACCTAGTGAATGGTCTTTACTGTTGCTATGTAAAAACAAACGAAACAACTGAC 
TTCATACCCCTGCCTCACGAAAACCCAAAAGACACAGCTGCCTCACGGTTGACGTTGTGTCC 
TCCTCCCCTGGACAATCTCCTCTTGGAACCAAAGGACTGCAGCTGTGCCATCGCGCCTCGGT 
CACCCTGCACAGCAGGCCACAGACTCTCCTGTCCCCCTTCATCGCTCTTAAGAATCAACAGG 
TTAAAACTCGGCTTCCTTTGATTTGCTTCCCAGTCACATGGCCGTACAAAGAGATGGAGCCC 
CGGTGGCCTCTTAAATTTCCCTTCTGCCACGGAGTTCGAAACCATCTACTCCACACATGCAG 
GAGGCGGGTGGCACGCTGCAGCCCGGAGTCCCCGTTCACACTGAGGAACGGAGACCTGTGAC 
CACAGCAGGCTGACAGATGGACAGAATCTCCCGTAGAAAGGTTTGGTTTGAAATGCCCCGGG 
GGCAGCAAACTGACATGGTTGAATGATAGCATTTCACTCTGCGTTCTCCTAGATCTGAGCAA 
GCTGTCAGTTCTCACCCCCACCGTGTATATACATGAGCTAACTTTTTTAAATTGTCACAAAA 
GCGCATCTCCAGATTCCAGACCCTGCCGCATGACTTTTCCTGAAGGCTTGCTTTTCCCTCGC 
CTTTCCTGAAGGTCGCATTAGAGCGAGTCACATGGAGCATCCTAACTTTGCATTTTAGTTTT 
TACAGTGAACTGAAGCTTTAAGTCTCATCCAGCATTCTAATGCCAGGTTGCTGTAGGGTAAC 
TTTTGAAGTAGATATATTACCTGGTTCTGCTATCCTTAGTCATAACTCTGCGGTACAGGTAA 
TTGAGAATGTACTACGGTACTTCCCTCCCACACCATACGATAAAGCAAGACATTTTATAACG 
ATACCAGAGTCACTATGTGGTCCTCCCTGAAATAACGCATTCGAAATCCATGCAGTGCAGTA 
TATTTTTCTAAGTTTTGGAAAGCAGGTTTTTTCCTTTAAAAAAATTATAGACACGGTTCACT 
AAATTGATTTAGTCAGAATTCCTAGACTGAAAGAACCTAAACAAAAAAATATTTTAAAGATA 
TAAATATATGCTGTATATGTTATGTAATTTATTTTAGGCTATAATACATTTCCTATTTTCGC 
ATTTTCAATAAAATGTCTCTAATACAAAAAA 



FIGURE 4 



MVKFPALTHYWPLIRFLVPLGITNIAIDFGEQALNRGIAAVKEDAVEMLASYGLAYSLMKFF 
TGPMSDFKNVGLVFVNSKRDRTKAVLCMVVAGAIAAVFHTLIAYSDIjGYYIINKLiHHVDESV 
GS KTRRAFL YLAAFP FMD AMAWTHAG I LLKHKYS FLVGC AS I SDVIAQWFVAILLHSHLEC 
REPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLALIL 
ATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEIRAVYPAFDKNNPSN 
KLVSTSNTVTAAHIKKFTFVCMALSLTLCFVMFWTPNVSEKILIDIIGVDFAFAELCWPLR 
IFSFFPVPVTVRAHLTGWLMTLKKTFVLAPSSVLRIIVLIASLVVLPYLGVHGATLGVGSLL 
AGFVGESTMVAIAACYWRKQKKKMENESATEGEDSAMTDMPPTEEVTDIVEMREENE 

Transmembrane domains: 

amino acids 86-106, 163-179, 191-205, 237-253, 327-343, 357-374, 
408-423, 431-445 



FIGURE 5 



CCTGACAGAAGTGCCCCGGAGCTGGGGGAGATNCAACATTAAGAAGATGCTGAGCTTCTGGT 
GCCNTTTGGCTCTAATTCTGGCCACACAGAGAANCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCGGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGNGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTC 



FIGURE 6 



TGACGGAATCCCGGGCTGGGTATCCTGGTTTNGACAAGATAAACCCCCAGCAANAAATTGGG 
GAGCAGGGCAAAACAGTNACGGGCAGCCCACATCAAGAAGTTCACCTTNGTTTGNATGGNTC 
TGTCAACTCACGCTNTGTTTCGTGATGTTTTGGACACCCAAAGTGTTTGAGAAAATTTTGAT 
AGACATNATCGGAGTGGANTTTGCCTTTGCAGAANTTTGNGNTGTTCCTTTGCGGATTTTCT 
CCTTTTTCCCAGTTCCAGTCACAGNGAGGGCGCATCTCACCGGGNGGNTGATGACANTGAAG 
AAAACCTTTGTCCTTGCCCCCAGCTNTTTGGTGCGGATCATTGTCCTNATNGCCAGCCTTGT 
GGTCCTACCCTACCTGGGGGTGCACGGTGCGACCCTGGGCGTGGGTTCCCTCCTGGCGGGCA 



FIGURE 7 

TATTCCCAGTTCCGGTCACGGGGAGGGCGCATNTCACCGGGTGGCTGANGACACTGAAGAAA 
ACCTTNGTCCTTGCCCCCAGNTTTGTGNTGCGGATNATCGTCCTCATCGCCAGCCTNGTGGT 
CCTACCCTACCTGGGGGTGCACGGTGAGAC 



FIGURE 8 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGA 
GCCTGCTCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAG 
CTGCTGCCCCGCCAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCC 
TGGGGGTGCTGGTGTCCATCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTG 
CCCTGGGTGTGTGAGGAGGGGGCCGGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGG 
CTCCCTGCTTGGCTACCGCGCTGTCTACCGCATGTGCTTCGCCACGGCGGCCTTCTTCTTCT 
TCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCAGCCGGGACCCCCGGGCTGCCATCCAG 
AATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTCACCGTGGGTGCCTTCTACAT 
CCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGGCTCCTTCCTCTTCA 
TCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGTGGCTGGGC 
AAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTCTT 
CTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCT 
GCCACGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCT 
GCTGTCCTGCCCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCAT 
CACCCTCTACACCATGTTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCA 
ACCCCCATTTGCCAACCCAGCTGGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAG 
ACCCAGTGGTGGGATGCCCCGAGCATTGTGGGCCTCATCATCTTCCTCCTGTGCACCCTCTT 
CATCAGTCTGCGCTCCTCAGACCACCGGCAGGTGAACAGCCTGATGCAGACCGAGGAGTGCC 
CACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGTGGCAGCCTGTGAGGGCCGGGCC 
TTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCCACTTCTGCCTGGTGCT 
GGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAGACCCGGAAGA 
TGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCTCCTC 
TACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCT 
CACAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCT 
GCCCCCTCCCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTG 
CCCCTGAGCCGGGCCTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCA 
GAGCCCCATCCCCCCGCCACACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGT 
TGCCCATACTCAGCATCTCGGATGAAAGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGG 
CTGCTGGAGAGAGCGGGGAACTCCCACCACAGTGGGGCATCCGGCACTGAAGCCCTGGTGTT 
CCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTGGACTTCGTGCCTTACTGAGTCTCT 
AAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 



FIGURE 9 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVE 
S QL YKLP WVCEEGAGI PTVLQGH I DCGS LLGYRAVYRMCFATAAFFFFFFTLLMLCVS S SRD 
PRAAI QNGFWFFKFL I LVGLTVGAF YI PDGS FTN I WFYFG WGS FLF I L I QLVLL IDFAHS W 
NQRWLGKAEECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFC 
VC VS I AAVLPKVQDAQPNSGLLQASVITLYTMFVTWSALS S I PEQKCNPHLPTQLGNET WA 
GPEGYETQWWDAPSIVGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVA 
ACEGRAFDNEQDGVTYSYSFFHFCLVLASLHVl^TLTNWYKPGETRKMISTWTAVWVKICAS 
WAGLLLYLWTLVAPLLLRNRDFS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 



FIGURE 10 



GAGCGAGGCCGGGGACTGAAGGTGTGGGTGTCGAGCCCTCTGGCAGAGGGTTAA.CCTGGGTC 
AAATGCACGGATTCTCACCTCGTACAGTTACGCTCTCCCGCGGCACGTCCGCGAGGACTTGA 
AGTCCTGAGCGCTCAAGTTTGTCCGTAGGTCGAGAGAAGGCCATGGAGGTGCCGCCACCGGC 
ACCGCGGAGCTTTCTCTGTAGAGCATTGTGCCTATTTCCCCGAGTCTTTGCTGCCGAAGCTG 
TGACTGCCGATTCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCC 
TATTACCCGGAATCTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAG 
AATTTCAAAGGACCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGG 
TGTATGGGGGAATACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCA 
GAAATTTATCATAACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTT 
CATTCGTTATGGCTGGCGCTGGGGTTGGAGAACTGCAGTGTTTGTGACTATATTCAACACAG 
TGAACACTAGTCTGAATGTATACCGAAATAAAGATGCCTTAAGCCATTTTGTAATTGCAGGA 
GCTGTCACGGGAAGTCTTTTTAGGATAAACGTAGGCCTGCGTGGCCTGGTGGCTGGTGGCAT 
AATTGGAGCCTTGCTGGGCACTCCTGTAGGAGGCCTGCTGATGGCATTTCAGAAGTACGCTG 
GTGAGACTGTTCAGGAAAGAAAACAGAAGGATCGAAAGGCACTCCATGAGCTAAAACTGGAA 
GAGTGGAAAGGCAGACTACAAGTTACTGAGCACCTCCCTGAGAAAATTGAAAGTAGTTTACG 
GGAAGATGAACCTGAGAATGATGCTAAGAAAATTGAAGCACTGCTAAACCTTCCTAGAAACC 
CTTCAGTAATAGATAAACAAGACAAGGACTGAAAGTGCTCTGAACTTGAAACTCACTGGAGA 
GCTGAAGGGAGCTGCCATGTCCGATGAATGCCAACAGACAGGCCACTCTTTGGTCAGCCTGC 
TGACAAATTTAAGTGCTGGTACCTGTGGTGGCAGTGGCTTGCTCTTGTCTTTTTCTTTTCTT 
TTTAACTAAGAATGGGGCTGTTGTACTCTCACTTTACTTATCCTTAAATTTAAATACATACT 
TATGTTTGTATTAATCTATCAATATATGCATACATGGATATATCCACCCACCTAGATTTTAA 
GCAGTAAATAAAACATTTCGCAAAAGATTAAAGTTGAATTTTACAGTTT 



FIGURE 11 

>< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA2 3318 
xsubunit 1 of 1, 285 aa, 1 stop 
><MW: 32190, pi: 9.03, NX{S/T) : 2 

MEVPPPAPRSFLCRALCLFPRVFAAEAVTADSEVLEERQKRLPYVPEPYYPESGWDRLRELF 
GKDEQQRISKDLANICKTAATAGIIGWVYGGIPAFIHAKQQYIEQSQAEIYHNRFDAVQSAH 
RAATRGFIRYGWRWGWRTAVFVTIFNTWTSLNVYRNKDALSHFVIAGAVTGSLFRINVGLR 
GLVAGGIIGALLGTPVGGLLMAFQKYAGETVQERKQKDRKALHELKLEEWKGRLQVTEHLPE 
KI ES SLREDEPENDAKKI EALLNLPRNPSVIDKQDKD 

Important Features: 
Signal Peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 76-96 and 171-195 

N-glycosylation site: 

amino acids 153-156 



FIGURE 12 



CGGAAGTCCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATCTGGATGGGACCGCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGGA 
CCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGAA 
TACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATCAT 
AACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTTCATTCGTTCATG 
GCTGGCGCCGAACC 



FIGURE 13 



TCAAGTTTGTCCGTAGGTCGAGAGAAGGCCATGGAGGTGCCGCCACCGGCACCGCGGAGCTT 
TTTTCTGTAGAGCATTGTGCCTATTTCCCCGAGTTTTTGCTGCCGAAGCTGTGACTGCCGAT 
TCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATTTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGG 
ACCTTGCTGATATNTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGA 
ATACCAGCTTTTATTCATGNTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATNA 
TAACC 



FIGURE 14 



GAGCCGCCGCCGCGCGCGCGCCGCGCACTGCAGCCCCAGGCCCCGGCCCCCCACCCACGTCT 
GCGTTGCTGCCCCGCCTGGGCCAGGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCT 
CCGCCGGAGTCGAATTTACGTGCAGCTGCCGGCAACCACAGGTTCCAAGATGGTTTGCGGGG 
GCTTCGCGTGTTCCAAGAACTGCCTGTGCGCCCTCAACCTGCTTTACACCTTGGTTAGTCTG 
CTGCTAATTGGAATTGCTGCGTGGGGCATTGGCTTCGGGCTGATTTCCAGTCTCCGAGTGGT 
CGGCGTGGTCATTGCAGTGGGCATCTTCTTGTTCCTGATTGCTTTAGTGGGTCTGATTGGAG 
CTGTAAAACATCATCAGGTGTTGCTATTTTTTTATATGATTATTCTGTTACTTGTATTTATT 
GTTCAGTTTTCTGTATCTTGCGCTTGTTTAGCCCTGAACCAGGAGCAACAGGGTCAGCTTCT 
GGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAATCTAAACTGCT 
GTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTCTGGCTAGCTGTGTTAAAAGTGACCAC 
TCGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGAGATTTGTTGG 
TGGCATTGGCCTGTTCTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACCTACAGATACA 
GGAACCAGAAAGACCCCCGCGCGAATCCTAGTGCATTCCTTTGATGAGAAAACAAGGAAGAT 
TTCCTTTCGTATTATGATCTTGTTCACTTTCTGTAATTTTCTGTTAAGCTCCATTTGCCAGT 
TTAAGGAAGGAAACACTATCTGGAAAAGTACCTTATTGATAGTGGAATTATATATTTTTACT 
CTATGTTTCTCTACATGTTTTTTTCTTTCCGTTGCTGAAAAATATTTGAAACTTGTGGTCTC 
TGAAGCTCGGTGGCACCTGGAATTTACTGTATTCATTGTCGGGCACTGTCCACTGTGGCCTT 
TCTTAGCATTTTTACCTGCAGAAAAACTTTGTATGGTACCACTGTGTTGGTTATATGGTGAA 
TCTGAACGTACATCTCACTGGTATAATTATATGTAGCACTGTGCTGTGTAGATAGTTCCTAC 
TGGAAAAAGAGTGGAAATTTATTAAAATCAGAAAGTATGAGATCCTGTTATGTTAAGGGAAA 
TCCAAATTCCCAATTTTTTTTGGTCTTTTTAGGAAAGATTGTTGTGGTAAAAAGTGTTAGTA 
TAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAAATAGTTAT 
GTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTGGTTT 
CATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCATCAGAATGGAACGAGTTT 
TGAGTAATCAGGAAGTATATCTATATGATCTTGATATTGTTTTATAATAATTTGAAGTCTAA 
AAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGATATTTG 
ATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAATTTCTCAGTATTGTAACAGCAACTT 
GTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATTGTGTG 
GCTCTGTATATTCTGTTAAAAAATTAAAGGACAGAAACCTTTCTTTGTGTATGCATGTTTGA 
ATTAAAAGAAAGTAATGGAAG 



FIGURE 15 

>< /us r / s eqdb2 / s s t /DNA/Dnaseqs .min/ss. DNA3 9979 
xsubunit 1 of 1, 204 aa, 1 stop 
><MW: 22147, pi: 8.37, NX(S/T): 3 

MVCGGFACS KNCLCALNLLYTLVSLLL I GI AAWGI GFGLI S S LRWGWIAVGI FLFLIALV 
GL I GAVKHHQVLL F FYMI I LLLVF I VQFSVSCACLALKQEQQGQLLEVGWNNTAS ARND I QR 
NLNCCGFRSVNPNDTCLASCVKSDHSCSPCAPIIGEYAGEVLRFVGGIGLFFSFTEILGVWL 
TYRYRNQKDPRANPSAFL 

Signal Peptide: 

amino acids 1-34 

Transmembrane domains: 

amino acids 47-63, 72-95 and 162-182 



FIGURE 16 



TGATTGGAGCTGTAAAAAANTCTTCAGGTGTTGTNATTTTTTTATATGATTATTCTGTAANT 
TGTATTTATTGTTCAGTTTTNTGTATCTTGCGCTTGTTTAGCCNTGAACCAGGAGCAACAGG 
GTCAGNTTNTGGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAAT 
NTAAACTGCTGTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTNTGGCTAGCTGTGTTAA 
AAGTGACCACTNGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGA 
GATTTGTTGGTGGCATTGGCCTGTTNTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACC 
TACAGATACAGGAACCAG 



FIGURE 17 



AATCCCAAATTCCCCAATTTTTTTGGNCTTTTTAGGGAAAGATGTGTTGTGGTAAAAAGTGT 
TAGTATAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAATAG 
TTATGTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTG 
GTTTCATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCCATCAGAATGGAACG 
AGTTTTGAGTAATCCAGGAAGTATATCTATATGATCTTGATATTGTTTTATATAATTTGAAG 
TCTAAAAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGAT 
ATTTGATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAAGTTCTCAGTATTGTAACAGC 
AACTTGTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATT 
GTGTGGAGGAAATGGCAATCTTATGTGTGCTGAAGGACACAGTAAGAGCACCAAGTTGTGCC 
CCACTTGC 



FIGURE 18 



ATGATTATTCTGTTACTTGTATTTATTGTTCAGTTTTATGGTATCTTGCGCTTGTTTAGCCC 
CTGAAACCAGGAGCAACAGGGNNCAGCTTCCTGGAGGTTGGTTGGCAACAATCACGGCCAAG 
TGACTCCGCAAATGACATCCCAGAGAAATCCTAAACTGCTGTGGGTTCCGAAGTGTTAACCC 
AAATGACACCTGTCTGGCTNGCTGTGTTAAAAGTGACCACTCGTGCTCGCCATGTGCTCCAA 
TCATAGGAGAATATGC 



FIGURE 19 



CAGTCACCATGAAGCTGGGCTGTGTCCTCATGGCCTGGGCCCTCTACCTTTCCCTTGGTGTG 
CTCTGGGTGGCCCAGATGCTACTGGCTGCCAGTTTTGAGACGCTGCAGTGTGAGGGACCTGT 
CTGCACTGAGGAGAGCAGCTGCCACACGGAGGATGACTTGACTGATGCAAGGGAAGCTGGCT 
TCCAGGTCAAGGCCTACACTTTCAGTGAACCCTTCCACCTGATTGTGTCCTATGACTGGCTG 
ATCCTCCAAGGTCCAGCCAAGCCAGTTTTTGAAGGGGACCTGCTGGTTCTGCGCTGCCAGGC 
CTGGCAAGACTGGCCACTGACTCAGGTGACCTTCTACCGAGATGGCTCAGCTCTGGGTCCCC 
CCGGGCCTAACAGGGAATTCTCCATCACCGTGGTACAAAAGGCAGACAGCGGGCACTACCAC 
TGCAGTGGCATCTTCCAGAGCCCTGGTCCTGGGATCCCAGAAACAGCATCTGTTGTGGCTAT 
CACAGTCCAAGAACTGTTTCCAGCGCCAATTCTCAGAGCTGTACCCTCAGCTGAACCCCAAG 
CAGGAAGC C C CATGACCCTGAGTTGTCAGACAAAGTTGCCCCTGCAGAGGTCAGCTGCCCGC 
CTCCTCTTCTCCTTCTACAAGGATGGAAGGATAGTGCAAAGCAGGGGGCTCTCCTCAGAATT 
CCAGATCCCCACAGCTTCAGAAGATCACTCCGGGTCATACTGGTGTGAGGCAGCCACTGAGG 
ACAACCAAGTTTGGAAACAGAGCCCCCAGCTAGAGATCAGAGTGCAGGGTGCTTCCAGCTCT 
GCTGCACCTCCCACATTGAATCCAGCTCCTCAGAAATCAGCTGCTCCAGGAACTGCTCCTGA 
GGAGGCCCCTGGGCCTCTGCCTCCGCCGCCAACCCCATCTTCTGAGGATCCAGGCTTTTCTT 
CTCCTCTGGGGATGCCAGATCCTCATCTGTATCACCAGATGGGCCTTCTTCTCAAACACATG 
CAGGATGTGAGAGTCCTCCTCGGTCACCTGCTCATGGAGTTGAGGGAATTATCTGGCCACCA 
GAAGCCTGGGACCACAAAGGCTACTGCTGA ATAGA AGTAAACAGTTCATCCATGATCTCACT 
TAACCACCCCAATAAATCTGATTCTTTATTTTCTCTTCCTGTCCTGCACATATGCATAAGTA 
CTTTTACAAGTTGTCCCAGTGTTTTGTTAGAATAATGTAGTTAGGTGAGTGTAAATAAATTT 
ATATAAAGTGAGAATTAGAGTTTAGCTATAATTGTGTATTCTCTCTTAACACAACAGAATTC 
TGCTGTCTAGATCAGGAATTTCTATCTGTTATATCGACCAGAATGTTGTGATTTAAAGAGAA 
CTAATGGAAGTGGATTGAATACAGCAGTCTCAACTGGGGGCAATTTTGCCCCCCAGAGGACA 
TTGGGCAATGTTTGGAGACATTTTGGTCATTATACTTGGGGGGTTGGGGGATGGTGGGATGT 
GTGTCTACTGGCATCCAGTAAATAGAAGCCAGGGGTGCCGCTAAACATCCTATAATGCACAG 
GGCAGTACCCCACAACGAAAAATAATCTGGCCCAAAATGTCAGTTGTACTGAGTTTGAGAAA 
CCCCAGCCTAATGAAACCCTAGGTGTTGGGCTCTGGAATGGGACTTTGTCCCTTCTAATTAT 
TATCTCTTTCCAGCCTCATTCAGCTATTCTTACTGACATACCAGTCTTTAGCTGGTGCTATG 
GTCTGTTCTTTAGTTCTAGTTTGTATCCCCTCAAAAGCCATTATGTTGAAATCCTAATCCCC 
AAGGTGATGGCATTAAGAAGTGGGCCTTTGGGAAGTGATTAGATCAGGAGTGCAGAGCCCTC 
ATGATTAGGATTAGTGCCCTTATTTAAAAAGGCCCCAGAGAGCTAACTCACCCTTCCACCAT 
ATGAGGACGTGGCAAGAAGATGACATGTATGAGAACCAAAAAACAGCTGTCGCCAAACACCG 
ACTCTGTCGTTGCCTTGATCTTGAACTTCCAGCCTCCAGAACTATGAGAAATAAAATTCTGG 
TTGTTTGTAGCCTAA 



FIGURE 2 0 



>< /us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA4 0594 
xsubunit 1 of 1, 359 aa, 1 stop 
><MW: 38899, pi: 5.21, NX(S/T): 0 

MKLGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREAGFQV 
KAYTFSEPFHLIVSYDWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGP 
NREFSITWQKADSGHYHCSGIFQSPGPGIPETASWAITVQELFPAPILRAVPSAEPQAGS 
PMTLS CQTKLPLQRS AARLLFS FYKDGRI VQSRGLS SE FQ I PTAS EDHSGS YWCEAATEDNQ 
VWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSSPL 
GMPDPHLYHQMGLLLICHMQDVRVLLGHLLMELRELSGHQKPGTTKATAE 

Signal sequence : 

amino acids 1-17 

Leucine zipper pattern sequence: 

amino acids 12-33 

Protein kinase C phosphorylation site: 

amino acids 353-355 



FIGURE 21 

CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAGTT 
TGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGACAGAAGTAGCTCTGGCTGTGATGGGG 
ATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGACACTTATGGCCGTCCCATCCT 
GGAAGTGCCAGAGAGTGTAACAGGACCTTGGAAAGGGGATGTGAATCTTCCCTGCACCTATG 
ACCCCCTGCAAGGCTACACCCAAGTCTTGGTGAAGTGGCTGGTACAACGTGGCTCAGACCCT 
GTCACCATCTTTCTACGTGACTCTTCTGGAGACCATATCCAGCAGGCAAAGTACCAGGGCCG 
CCTGCATGTGAGCCACAAGGTTCCAGGAGATGTATCCCTCCAATTGAGCACCCTGGAGATGG 
ATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTGATGGCAACCAAGTCGTG 
AGAGATAAGATTACTGAGCTCCGTGTCCAGAAACTCTCTGTCTCCAAGCCCACAGTGACAAC 
TGGCAGCGGTTATGGCTTCACGGTGCCCCAGGGAATGAGGATTAGCCTTCAATGCCAGGCTC 
GGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAACAGACTAATAACCAGGAACCCATC 
AAAGTAGCAACCCTAAGTACCTTACTCTTCAAGCCTGCGGTGATAGCCGACTCAGGCTCCTA 
TTTCTGCACTGCCAAGGGCCAGGTTGGCTCTGAGCAGCACAGCGACATTGTGAAGTTTGTGG 
TCAAAGACTCCTCAAAGCTACTCAAGACCAAGACTGAGGCACCTACAACCATGACATACCCC 
TTGAAAGCAACATCTACAGTGAAGCAGTCCTGGGACTGGACCACTGACATGGATGGCTACCT 
TGGAGAGACCAGTGCTGGGCCAGGAAAGAGCCTGCCTGTCTTTGCCATCATCCTCATCATCT 
CCTTGTGCTGTATGGTGGTTTTTACCATGGCCTATATCATGCTCTGTCGGAAGACATCCCAA 
CAAGAGCATGTCTACGAAGCAGCCAGGTAAGAAAGTCTCTCCTCTTCCATTTTTGACCCCGT 
CCCTGCCCTCAATTTTGATTACTGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCACAGACCC 
AATCCTAAGGCCGGAGGCCTTCAGGGTCAGGACATAGCTGCCTTCCCTCTCTCAGGCACCTT 
CTGAGGTTGTTTTGGCCCTCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCC 
AGAATCCCTGGGTGGTAGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGA 
AACCAGGACCACAGCCCCAAGTCCCTTCTTATGGGTGGTGGGCTCTTGGGCCATAGGGCACA 
TGCCAGAGAGGCCAACGACTCTGGAGAAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCT 
CCAGTGATGAGCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGA 
CAGGAGTACCAGATCATCGCCCAGATCAATGGCAACTACGCCCGCCTGCTGGACACAGTTCC 
TCTGGATTATGAGTTTCTGGCCACTGAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGC 
CAGGATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCT 
ACCTCTCTTCCTGGATAGCCCAAAGTGTCCGCCTACCAACACTGGAGCCGCTGGGAGTCACT 
GGCTTTGCCCTGGAATTTGCCAGATGCATCTCAAGTAAGCCAGCTGCTGGATTTGGCTCTGG 
GCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAAGATG 
CCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATCTTGCCA 
CCAGAAGACCCGAGGGAGGCTCAGCTCTGCCAGCTCAGAGGACCAGCTATATCCAGGATCAT 
TTCTCTTTCTTCAGGGCCAGACAGCTTTTAATTGAAATTGTTATTTCACAGGCCAGGGTTCA 
GTTCTGCTCCTCCACTATAAGTCTAATGTTCTGACTCTCTCCTGGTGCTCAATAAATATCTA 
ATCATAACAGC 



FIGURE 22 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA45416 
xsubunit 1 of 1, 321 aa, 1 stop 
><MW: 35544, pi: 8.51, NX(S/T): 0 

MGILLGLLLLGHLTVDTYGRPILEVPESVTGPWKGDVNLPCTYDPLQGYTQVLVKWLVQRGS 
DPVTIFLRDSSGDHIQQAKYQGRLHVSHKVPGDVSLQLSTLEMDDRSHYTCEVTWQTPDGNQ 
WRDKITELRVQKLSVSKPTVTTGSGYGFTVPQGMRI SLQCQARGSPPI SYIWYKQQTNWQE 
PIKVATLSTLLFKPAVIADSGSYFCTAKGQVGSEQHSDIVKFWKDSSKLLKTKTEAPTTMT 
YPLKATSTVKQSWDWTTDMDGYLGETSAGPGKSLPVFAIILIISLCCMWFTMAYIMLCRKT 
SQQEHVYEAAR 

Signal Sequence: 

amino acids 1-19 

Glycosaminoglycan attachment site: 

amino acids 149-152 

Transmembrane domain: 

amino acids 282-300 



FIGURE 23 



GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACAT 
GGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGA 
GGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATC 
GGGATGTCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCA 
CACTGAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGC 
TTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAA 
GTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCG 
AGTGGCCTTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGC 
CCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCAT 
GTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAAGGAGAGCTGAC 
AGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATT 
ACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATT 
GACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTA 
CCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGT 
ATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTG 
ATTTTCCTCTTGGTGTGGCTGCTAATCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGA 
GAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCT 
CCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAAT 
AGTGCCTCACGCAGCCAGCGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCAC 
CCAGGCATACAGCCTAGTGGGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATG 
CTAATCTGACCAAAGCAGAAACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAA 
ACGGTCTGAATTACAATGGACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTC 
TTCTCGTCATTGGAGCTCAAGTCACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCA 
GTGAGCATTGCACGGAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAA 
AGGATGTAAGCTGATTCATCTGTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGG 
AAAGCAGGAGTCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTG 
AGGTGAATATACCTAAAACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATT 
TTCAAGAGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATT 
ATTAGTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTAC 
TGAGCTAACCACTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAAC 
TTCATTTGTCATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGA 
AGAGTGAATGAGTTTCTCCCACTCTATACTAATCTCACTATTTGTATTGAGCCCAAAATAAC 
TATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTT 
ATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT 
CAGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCATT 
TATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAATGCTGAAAGATTA 
CCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAGACTGG 
GTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAAGGAAGGTAGCCGGGCA 
TGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGAGATTATGCC 
ATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 



FIGURE 24 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45419 
xsubunit 1 of 1, 373 aa, 1 stop 
><MW: 41281, pi: 8.33, NX(S/T): 3 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKYTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKV 
VITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHV 
ILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRID 
YNHPGRVLLQNLTMSYSGLyQCTAGNEAGKESCWRVTVQYVQSIGMVAGAVTGIVAGALLI 
FLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANS 
ASRSQRTLSTDAAPQPGIATQAYSLVGPEWGSEPKKVHHANLTKAETTPSMIPSQSRAFQTV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 232-251 



FIGURE 25 



GTCGTTCCTTTGOTCTCTCGCGCCCaGTCCTCCTCCCTGGTTCTCCTCAGCCGCTGTCGGAGGAGAGCACCCGGA 
GACGCGGGCTGCAGTCGCGGCGGCTTCTCCCCGCCTGGGCGGCCTCGCCGCTGGGCAGGTGCTGAGCGCCCCTAG 
AGCCTCCCTTGCCGCCTCCCTCCTCTGCCCGGCCGCAGCAGTGCACATGGGGTGTTGGAGGTAGATGGGCTCCCG 
GCCCGGGAGGCGGCGGTGGATGCGGCGCTGGGCAGAAGCAGCCGCCGATTCCAGCTGCCCCGCGCGCCCCGGGCG 
CCCCTGCGAGTCCCCGGTTCAGCC^TGGGGACCTCTCCGAGCAGCAGCACCGCCCTCGCCTCCTGCAGCCGCATC 
GCCCGCCGAGCC^C^GCCACGATGATCGCGGGCTCCCTTCTCCTGCTT^^ 

CCAGAACAGAAGGCCTCGAATCTCATTGGCACATACCGCCATGTTGACCGTGCCACCGGCCAGGTGCTAACCTGT 
GACAAGTGTCCAGCAGGAACCTATGTCTCTGAGCATTGTACCAAt^CAAGCCTGCGCGTCTGCAGCAGTTGCCCT 
GTGGGGACCTTTACGAGGCATGAGAATGGCATAGAGAAATGCCATGACTGTAGTCAGCCATGCCCATGGCCAATG 
ATTGAGAAATTACCTTGTGCTGCCTTGACTGACCGAGAATGCACTTGCCCACCTGGCATGTTCCAGTCTAACGCT 
ACCTGTGCCCCCCATACGGTGTGTCCTGTGGGTTGGGGTGTGCGGAAGAAAGGGACAGAGACTGAGGATGTGCGG 
TGTAAGCAGTGTGCTCGGGGTACCTTCTCAGATGTGCCTTCTAGTGTGATGAAATGCAAAGCATACACAGACTGT 
CTGAGTCAGAACCTGGTGGTGATCAAGCCGGGGACC^GGAGACAGACAACGTCTGTGGCACACTCCCGTCCTTC 
TCCAGCTCCACCTCACCTTCCCCTGGCACAGCCATCTTTCCACGCCCTGAGCACATGGAAACCCATGAAGTCCCT 
TCCTCCACTTATGTTCCCS^GGCATGAACTCAACAGAATCCAACTCTTCTGCCTCTGTTAGACCAAAGGTACTG 
AGTAGCATCCAGGAAGGGACAGTCCCTGACAACACAAGCTCAGCAAGGGGGAAGGAAGACGTGAACAAGACCCTC 
CCAAACCTTCAGGTAGTCAACCACCAGCAAGGCCCCCACCACAGACACATCCTGAAGCTGCTGCCGTCCATGGAG 
GCCACTGGGGGCGAGAAGTCCAGCACGCCCATCAAGGGCCCCAAGAGGGGACATCCTAGACAGAACCTACACAAG 
CATTTTGACATCAATGAGCATTTGCCCTGGATGATTGTGCTTTTCCTGCTGCTGGTGCTTGTGGTGATTGTGGTG 
TGCAGTATCCGGAAAAGCTCGAGGACTCTGAAAAAGGGGCCCCGGCAGGATCCCAGTGCCATTGTGGAAAAGGCA 
GGGCTGAAGAAATCCATGACTCCAACCCAGAACCGGGAGAAATGGATCTACTACTGCAATGGCCATGGTATCGAT 
ATCCTGAAGCTTGTAGCAGCCC!AAGTGGGAAGCCAGTGGAAAGATATCTATCAGTTTCTTTGCAATGCCAGTGAG 
AGGGAGGTTGCTGCTTTCTCCAATGGGTACACAGCCGACCACGAGCGGGCCTACGCAGCTCTGCAGCACTGGACC 
ATCCGGGGCCCCGAGGCCAGCCTCGCCCAGCTAATTAGCGCCCTGCGCCAGCACCGGAGAAACGATGTTGTGGAG 
AAGATTCGTGGGCTGATGGAAGACACCACCCAGCTGGAAACTGACAAACTAGCTCTCCCGATGAGCCCCAGCCCG 
CTTAGCCCGAGCCCCATCCCCAGCCCCAACGCGAAACTTGAGAATTCCGCTCTCCTGACGGTGGAGCCTTCCCCA 
CAGGACAAGAACAAGGGCTTCTTCGTGGATGAGTCGGAGCCCCTTCTCCGCTGTGACTCTACATCCAGCGGCTCC 
TCCGCGCTGAGCAGGAACGGTTCCTTTATTACCAAAGAAAAGAAGGACACAGTGTTGCGGCAGGTACGCCTGGAC 
CCCTGTGACTTGCAGCCTATCTTTGATGACATGCTCCACTTTCTAAATCCTGAGGAGCTGCGGGTGATTGAAGAG 
ATTCCCCAGGCTGAGGACAAA.CTAGACCGGCTATTCGAAATTATTGGAGTCAAGAGCCAGGAAGCCAGCCAGACC 
CTCCTGGACTCTGTTTATAGCCATCTTCCTGACCTGCT GTAGA ACATAGGGATACTGCATTCTGGAAATTACTCA 
ATTTAGTGGCAGGGTGGTTTTTTAATTTTCTTCTGTTTCTGATTTTTGTTGTTTGGGGTGTGTGTGTGTGTTTGT 

TCTCTCTCTTTTTTTTTTAAATAACTCTTCTGGGAAGTTGGTTTATAAGCCTTTGCCAGGTGTAACTGTTGTGAA 
ATACCCACCACTAAAGTTTTTTAAGTTCCATATTTTCTCCATTTTGCCTTCTTATGTATTTTCAAGATTATTCTG 
TGCACTTTAAATTTACTTAACTTACCATAAATGCAGTGTGACTTTTCCCACACACTGGATTGTGAGGCTCTTAAC 
TTCTTAAAAGTATAATGGCATCTTGTGAATCCTATAAGCAGTCTTTATGTCTCTTAACATTCACACCTACTTTTT 
AAAAACAAATATTATTACTATTTTTATTATTGTTTG^ 

CCCCATTGAGTTACTGTAATGCAATTCAACTTTGAGTTATCTTTTAAATATGTCTTGTATAGTTCATATTCATGG 
CTGAAACTTGACCACACTATTGCTGATTGTATGGTTTTCACCTGGACACCGTGTAGAATGCTTGATTACTTGTAC 
TCTTCTTATGCTAATATGCTCTGGGCTGGAGAAATGAAATCCTCAA.GCCATCAGGATTTGCTATTTAAGTGGCTT 
GACAACTGGGCCACCAAAGAACrrGAACTTCACCTTTTAGGATTTGAGCTGTTCTGGAACACATTGCTGCACTTT 
GGAAAGTCAAAATCAAGTGCCAGTGGCGCCCTTTCCATAGAGAATTTGCCCAGCTTTGCTTTAAAAGATGTCTTG 
TTTTTTATATACACATAATCAATAGGTCCAATCTGCTCTCAAGGCCTTGGTCCTGGTGGGATTCCTTCACCAATT 
ACTTTAATTAAAAATGGCTGCAACTGTAAGAACCCTTGTCTGATATATTTGCAACTATGCTCCCATTTACAAATG 
TACCTTCTAATGCTCAGTTGCCAGGTTCCZ^TGCAAAGGTGGCGTGGACTCCCTTTGTGTGGGTGGGGTTTGTGG 
GTAGTGGTGAAGGACCGATATCAGAAAAATGCCTTCAAGTGTACTAATTTATTAATAAACATTAGGTGTTTGTTA 
AAAAAAAAA 



FIGURE 26 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA52594 
xsubunit 1 of 1, 655 aa, 1 stop 
><MW: 71845, pi: 8.22, NX(S/T): 8 

MGTSPSSSTALASCSRIARRATATMIAGSLLLLGFLSTTTAQPEQKASNLIGTYRHVDRATG 
QVLTCDKCPAGTYVSEHCTNTSLRVCSSCPVGTFTRHENGI EKCHDCSQPCPWPMI EKLPCA 
ALTDRECTCPPGMFQSNATCAPHTVCPVGWGVRKKGTETEDVRCKQCARGTFSDVPSSVMKC 
KAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTSPSPGTAIFPRPEHMETHEVPSSTYVP 
KGMNSTESNSSASVRPKVLSSIQEGTVPDNTSSARGKEDWICTLPNLQVWHQQGPHHRHIL 
KLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLPWMIVLFLLLVLWIWCSIRK 
SSRTLKKGPRQDPSAIVEKAGLKKSMTPTQNREKWIYYCNGHGIDILKLVAAQVGSQWKDIY 
QFLCNAS ERE VAAFSNGYTADHERAYAALQHWTI RGPEAS LAQL I S ALRQHRRND WEKI RG 
LMEDTTQLETDKLALPMS PS PLS PSPI PS PNAKLENS ALLTVEPS PQDKNKGFFVDE SE PLL 
RCDSTSSGSSALSRNGSFITICEKKDTVLRQVRLDPCDLQPIFDDMLHFLNPEELRVIEEIPQ 
AEDKLDRLFE I IGVKSQEASQTLLDSVYSHLPDLL 

Signal sequence: 

amino acids 1-41 

Transmembrane domain: 

amino acids 350-370 



FIGURE 27 



ATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGA 
CTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCAC 
CATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 
TTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGCTGTTGCTGCACAGATCCTG 
TCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCATTGGGATCATTGCATTGATATTAGC 
ACTGGCCATTGGTCTGGGCATCCACTTCGACTGCTCAGGGAAGTACAGATGTCGCTCATCCT 
TTAAGTGTATCGAGCTGATAGCTCGATGTGACGGAGTCTCGGATTGCAAAGACGGGGAGGAC 
GAGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCTTC 
GTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCCCAAC 
TGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGGCAGTTC 
CGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCATTACACCA 
CTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAGTGCACAGCCT 
GTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCCTTGCTCTCGCAG 
TGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGGGGCTCTGTCATCAC 
GCCCCTGTGGATCATCACTGCTGCACACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGA 
CCATCCAGGTGGGTCTAGTTTCCCTGTTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAG 
ATTGTCTACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAATGACATCGCCCTTATGAAGCT 
GGCCGGGCCACTCACGTTCAATGAAATGATCCAGCCTGTGTGCCTGCCCAACTCTGAAGAGA 
ACTTCCCCGATGGAAAAGTGTGCTGGACGTCAGGATGGGGGGCCACAGAGGATGGAGGTGAC 
GCCTCCCCTGTCCTGAACCACGCGGCCGTCCCTTTGATTTCCAACAAGATCTGCAACCACAG 
GGACGTGTACGGTGGCATCATCTCCCCCTCCATGCTCTGCGCGGGCTACCTGACGGGTGGCG 
TGGACAGCTGCCAGGGGGACAGCGGGGGGCCCCTGGTGTGTCAAGAGAGGAGGCTGTGGAAG 
TTAGTGGGAGCGACCAGCTTTGGCATCGGCTGCGCAGAGGTGAACAAGCCTGGGGTGTACAC 
CCGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGACCTAAAAACC TGAA 
GAGGAAGGGGACAAGTAGCCACCTGAGTTCCTGAGGTGATGAAGACAGCCCGATCCTCCCCT 
GGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTGAGTTCCGGCACCA 
GTAGCAGGCCCGAAAGAGGCACCCTTCCATCTGATTCCAGCACAACCTTCAAGCTGCTTTTT 

TCCCTGCTCACTGCAGCCTCCGCTTCCCTGGTTCAAGCGATTCTCTTGCCTCAGCTTCCCCA 
GTAGCTGGGACCACAGGTGCCCGCCACCACACCCAACTAATTTTTGTATTTTTAGTAGAGAC 
AGGGTTTCACCATGTTGGCCAGGCTGCTCTCAAACCCCTGACCTCAAATGATGTGCCTGCTT 
CAGCCTCCCACAGTGCTGGGATTACAGGCATGGGCCACCACGCCTAGCCTCACGCTCCTTTC 
TGATCTTCACTAAGAACAAAAGAAGCAGCAACTTGCAAGGGCGGCCTTTCCCACTGGTCCAT 
CTGGTTTTCTCTCCAGGGTCTTGCAAAATTCCTGACGAGATAAGCAGTTATGTGACCTCACG 
TGCAAAGCCACCAACAGCCACTCAGAAAAGACGCACCAGCCCAGAAGTGCAGAACTGCAGTC 
ACTGCACGTTTTCATCTCTAGGGACCAGAACCAAACCCACCCTTTCTACTTCCAAGACTTAT 
TTTCACATGTGGGGAGGTTAATCTAGGAATGACTCGTTTAAGGCCTATTTTCATGATTTCTT 
TGTAGCATTTGGTGCTTGACGTATTATTGTCCTTTGATTCCAAATAATATGTTTCCTTCCCT 
CATTGTCTGGCGTGTCTGCGTGGACTGGTGACGTGAATCAAAATCATCGACTGAAA 



FIGURE 28 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45234 
xsubunit 1 of 1, 453 aa, 1 stop 
><MW: 49334, pi: 6.32, NX(S/T): 1 

MGENDPPAVEAPFS FRS LFGLDDLKI S PVAPDADAVAAQ I LSLLPLKPFPI I VI GI I AL I LA 
LAIGLGIHFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAAS 
WKTMCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHH 
SVYVREGCASGHWTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVIT 
PLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMKL 
AGPLTFNEMI QPVCLPNSEENFPDGKVCWTSGWGATEDGGDASPVLNHAAVPL I SNKI CNHR 
DVYGGI I S PSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATS FGIGCAEVNKPGVYT 
RVTS FLDW IHEQMERDLKT 

Signal Peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 240-284 



FIGURE 29 

CCCACGCGTCCGTCCTAGTCCCCGGGCCAACTCGGACAGTTTGCT^ 

GCCAGAACGGCGCGCGCGCGCGCACGCACGCAGA.CACACGGGGGGAAACTTTTTTAAAAATGAAAGGCTAGAAGA 
GCTCAGCGGCGGCGCGGGCGCTGCGCX^GGGCTCCGGAGCTGACTCGCCGAGGCAGGAAATCCCTCCGGTCGCGA 
CGCCCGGCCCCGGCTCGGCGCCCGCGTGGGATGGTGCAGCGCTCGCCGCCGGGCCCGAGAGCTGCTGCACTGAAG 
GCCGGCGACGATGGCAGCGCGCCCGCTGCCCGTGTCCCCCGCCCGCGCCCTCCTGCTCGCCCTGGCCGGTGCTCT 
GCTCGCGCCCTGCGAGGCCCGAGGGGTGAGCTTATGGAACCAAGGAAGAGCTGATGAAGTTGTCAGTGCCTCTGT 
TCGGAGTGGGGACCTCTGGATCCCAGTGAAGAGCTTCGACTCCAAGAATCATCCAGAAGTGCTGAATATTCGACT 
ACAACGGGAAAGCAAAGAACTGATCATAAATCTGGAAAGAAATGAAGGTCTCATTGCCAGCAGTTTCIACGGAAAC 
CCACTATCTGCAAGACGGTACTGATGTCTCCCTCGCTCGAAATTACACGGGTCACTGTTACTACCATGGACATGT 
ACGGGGATATTCTGATTCAGGAGTCAGTCTCAGCACGTGTTCTGGTCTCAGGGGACTTATTGTGTTTGAAAATGA 
AAGCTATGTCTTAGAACC^TGAAAAGTGCAACCAACaGATACAAACTCTTCCC^GCGAAGAAGCTGAAAAGCGT 
CCGGGGATC&TGTGGATCACATCACAACAC^ 

ATGGGCAAGAAGGCATAAAAGAGAGACCCTCAAGGCAACTAAGTATGTGGAGCTGGTGATCGTGGCAGACAACCG 
AGAGTTTCAGAGGCAAGGAAAAGATCTGGAAAAAGTTAAGCAGCGATTAATAGAGATTGCTAATCACGTTGACAA 
GTTTTACAGACCACTGAACATTCGGATCGTGTTGGTAGGCGTGGAAGTGTGGAATGACATGGACAAATGCTCTGT 
AAGTCAGGACCCATTCACCAGCCTCCATGAATTTCTGGACTGGAGGAAGATGAAGCTTCTACCTCGCAAATCCCA 
TGACAATGCGCAGCTTGTCAGTGGGGTTTATTTCCAAGGGACCACCATCG^ 

CACGGCAGACCAGTCTGGGGGAATTGTCATGGACCATTCAGACAATCCCCTTGGTGCAGCCGTGACCCTGGCACA 
TGAGCTGGGCCACAATTTCGGGATGAATCATGACACACTGGACAGGGGCTGTAGCTGTCAAATGGCGGTTGAGAA 
AGGAGGCTGCATCATGAACGCTTCCACCGGGTACCCATTTCCCATGGTGTTCAGCAGTTGCAGCAGGAAGGACTT 
GGAGACCAGCCTGGAGAAAGGAATGGGGGTGTGCCTGTTTAACCTGCCGGAAGTCAGGGAGTCTTTCGGGGGCCA 
GAAGTGTGGGAACAGATTTGTGGAAGAAGGAGAGGAGTGTGACTGTGGGGAGCCAGAGGAATGTATGAATCGCTG 
CTGCAATGCCACCACCTGTACCCTGAAGCCGGACGCTGTGTGCGCACATGGGCTGTGCTGTGAAGACTGCCAGCT 
GAAGCCTGCAGGAACAGCGTGCAGGGACTCCAGCAACTCCTGTGACCTCCCAGAGTTCTGCACAGGGGCCAGCCC 
TCACTGCCCAGCCAATGTGTACCTGCACGATGGGCACTCATGT^ 

CTGCCAGACTCACGAGC^GCAGTGTGTCACGCTCTGGGGACCAGGTGCTAAACCTGCCCCTGGGATCTGCTTTGA 

GAGAGTCAATTCTGCAGGTGATCCTTATGGCAA.CTGTGGCAAA.GTCTCGAAGAGTTCCTTTGCCAAATGCGAGAT 

GAGAGATGCTAAATGTGGAAAAATCCAGTGTGAAGGAGGTGCCAGCCGGCCAGTCATTGGTACCAATGCCGTTTC 

CATAGAAACAAACATCCCTCTGCAGCAAGGAGGCCGGATTCTGTGCCGGGGGACCCACGTGTACTTGGGCGATGA 

CATGCCGGACCCAGGGCTTGTGCTTGC^GGCACAAAGTGTGCAGATGGAAAAATCTGCCTGAATCGTCAATGTCA 

AAATATTAGTGTCTTTGGGGTTCACGAGTGTGCAATGCAGTGCCACGGCAGAGGGGTGTGCAACAACAGGAAGAA 

CTGCCACTGCGAGGCCCACTGGGCACCTCCCTTCTGTGACAAGTTTGGCTTTGGAGGAAGCACAGACAGCGGCCC 

CATCCGGCAAGCAGAAGCAAGGCAGGAAGCTGCAGAGTCCAACAGGGAGCGCGGCCAGGGCCA.GGAGCC 

ATCGCAGGAGCATGCGTCTACTGCCTCACTGACACTCATCTGAGCCCTCCCATGACATGGAGACCGTGACCAGTG 

CTGCTGCAGAGGAGGTCACGCGTCCCCAAGGCCTCCTGTGACTGGCAGCATTGACTCTGTGGCTTTGCCATCGTT 

TCCATGACAACAGACACAACACAGTTCTCGGGGCTCAGGAGGGGAAGTCC^ 

CAGTGC^^GGAAGGGCAGCGACTTCCTGGTTGAGCTTCTGCTAAAACATGGACATGCTTCAGTGCTGCTCCTGAG 

AGAGTAGCAGGTTACCACTCTGGCAGGCCCCAGCCCTGCAGCAAGGAGGAAGAGGACTCAAAAGTCTGGCCTTTC 

ACTGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTTGGGCCCAGTGTCCCCTTTCCCCAGTGACACCTCAGCCT 

TGGCAGCCCTGATGACTGGTCTCTGGCTGCAACTTAATGCTCTGATATGGCTTTTAGCATTTATTATATGAAAAT 

AGCAGGGTTTTAGTTTTTAATTTATCAGAGACCCTGCCACCCATTCCATCTCCATCCAAGCAAACTGAATGGC^ 

TGAAACAAACTGGAGAAGAAGGTAGGAGAAAGGGCGGTGAACTCTGGCTCTTTGCTGTGGACATGCGTGACCAGC 

AGTACTC^GGTTTGAGGGTTTGCAGAAAGCCAGGGAACCCACAGAGTCACCAACCCTTCATTTAACAAGTAAG 

TGTTAAAAAGTGAAAACAATGTAAGAGCCTAACTCCATCCCCCGTGGCCATTACTGCATAAAATAGAGTGCATTT 

GAAAT 



FIGURE 3 0 



></usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA49624 
xsubunit 1 of 1, 735 aa, 1 stop 
><MW: 80177, pi: 7.08, NX(S/T): 5 

MAARPLPVSPARALLLALAGALLAPCEARGVSLWNQGRADEWSASVRSGDLWIPVKSFDSK 

NHPEVLNIRLQRESKELIINLERNEGLIASSFTETHYLQDGTDVSLARNYTGHCYYHGHVRG 

YSDSAVSLSTCSGLRGLIVFENESY^7'LEPMKSATNRYKLFPAKKLKSVRGSCGSHHNTPNIjA 

AKNVFPPPSQTWARRHKRETLKATKYVELVI VADNREFQRQGKDLEKVKQRL I E I ANHVDKF 

YRPIJSriRIVLVGVEVP^MDKCSVSQDPFTSLHEFLDWRKMKLLPRKSHDNAQLVSGVYFQG 

TTIGMAPIMSMCTADQSGGIVMDHSDNPLGAAVTLAHELGHNFGMNHDTLDRGCSCQmVE 

GGCIMNASTGYPFPMVFSSCSRKDLETSLEKGMGVCLFNLPEVRESFGGQKCGNRFVEEGEE 

CDCGEPEECMNRCCNATTCTIiKPDAVGAHGLCCEDCQIiKPAGTACRDSSNSCDIjPEFCTGAS 

PHCPANVYLHDGHSCQDVDGYCYNGICQTHEQQCVTLWGPGAKPAPGICFERVNSAGDPYGN 

CGKVSKSSFAKCEMRDAKCGKIQCQGGASRPVIGTNAVSIETNIPLQQGGRILCRGTHVYLG 

DDMPDPGLVLAGTKCADGKICLNRQCQNISVFGVHEC 

FCDKFGFGGSTDSGPIRQAEARQEAAESNRERGQGQEPVGSQEHASTASLTLI 

Signal peptide: 

amino acids 1-28 



FIGURE 31 



TCCCAAGGCTTCTTGGATGGCAGATGATTNTGGGGTTTTGCATTGTTTCCCTGACAACGAAA 
ACAAAACAGTTTTGGGGGTTCAGGAGGGGAANTCCAGCCTACCCAGGAAGTTTGCAGAAACA 
GTGCAAGGAAGGGCAGGANTTCCTGGTTGAGNTTTTTGNTAAAACATGGACATGNTTCAGTG 
CTGCTCNTGAGAGAGTAGCAGGTTACCACTTTTGGCAGGCCCCAGCCCTGCAGCAAGGAGGA 
AGAGGACTCAAAAGTTTGGCCTTTCACTGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTT 
GGGCCCAGTGTCCCCTTTCCCCAGTGACACCTCAGCCTTGGCAGCCCTGATAACTGGTNTNT 
GGCTGCAANTTAATGCTNTGATATGGCTTTTAGCATTTATTATATGAAAATAGCAGGGTTTT 
AGTTTTTAATTTATCAGAGACCCTGCCACCCATTCCATNTCCATCCAAG 



FIGURE 32 



CATCCTGCAACATGGTGAAACCACGCCTGGCTAATTTTGTTGTATTTTTGGTAGAGATGGGA 
TTTCACCGTGTTAGCCAGGATTGTCTCAATCTGACCTCATGATCTGCCCGCCTCGGCCTCCC 
AAAGTGCTGGGATTACAGGCGAGTGCAACCACACCCGGCCACAAACTTTTTAAGAAGTTAAT 
GAAACCATACCTTTTACATTTTTAATGACAGGAAAATGCTGAGAATAATTGTTAACCCAAAA 
TTCTGGATACAAAAGTACAATCTTTACTGTGTAAATACATGTATATGTACTATATGAAAATA 
TACCAAATATCAATAATACTTATCTCTGGGTAAAAACCTCTTCTCATACCCTGTGCTAACAA 
CTTTTAACAAAAAATTTGCATCACTTTTAAGAATCAAGAAAAATTTCTGAAGGTCATATGGG 
ACAGAAAAAAAAACCAAGGGAAAAATCACGCCACTTGGGAAAAAAAGATTCGAAATCTGCCT 
TTTTATAGATTTGTAATTAATAAGGTCCAGGCTTTCTAAGCAACTTAAATGTTTTGTTTCGA 
AACAAAGTACTTGTCTGGATGTAGGAGGAAAGGGAGTGATGTCACTGCCATTATGATGCCCC 
TTGAATATAAGACCCTACTTGCTATCTCCCCTGCACCAGCCAGGAGCCACCCATCCTCCAGC 
ACACTGAGCAGCAAGCTGGACACACGGCACACTGATCCAAATGGGTAAGGGGATGGTGGCGA 
TGCTCATTCTGGGTCTGCTACTTCTGGCGCTGCTCCTACCCGTGCAGGTTTCTTCATTTGTT 
CCTTTAACCAGTATGCCGGAAGCTACTGCAGCCGAAACCACAAAGCCCTCCAACAGTGCCCT 
ACAGCCTACAGCCGGTCTCCTTGTGGTCTTGCTTGCCCTTCTACATCTCTACCA TTAAG AGG 
CAGGTCAAGAAACAGCTACAGTTCTCCAACCCATACACTAAAACCGAATCCAAATGGTGCCT 
AGAAGTTCAATGTGGCAAGGAAAAAAACCAGGTCTTCATCAAATCTACTAATTTCACTCCTT 
ATTAACAGAGAAACGCTTGAGAGTCTCAAACTGGACTGGTTTAAAGAGCATCTGAAGGATTT 
GACTAGATGATAAATGCCTGTACTCCCAGTACTTTGGGAGGCCTAGGCCGGCGGATCACCTG 
AGGTCAGGAGTTTGAGACTAACCTGGCCAAAATGGTGAAACCCCATCTGTACTAAAAATACA 
AATATTGACTGGGCGTGGTGGTGAGTGCCTGTGATCCCAGCTACTCAGGTGGCTGAAGCAGG 
ACAATCACTTGAACTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCGCGCTACTGCACTCTA 
GCCTAGCCTGGGCAACAGAGTGAGACTTCGTCTCAAAAAAAAAAAAGCCAAGTGCAGTGGCT 
CACGCCTGTAATCCCGGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGATCA 
AGACCATCCTGGCTAATACAGTGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGG 
GATGGTGGCAGGCACCTGGAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATAGCGTGAA 
CTCAGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCTACTGCACTCCAGCCTGGGCGACAG 
CGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 33 

></usr/seqdb2 /sst/DNA/Dnaseqs . min/ss . DNA4 8309 
xsubunit 1 of 1, 67 aa, 1 stop 
><MW: 6981, pi: 7.47, NX(S/T): 0 

MGKGMVAMLILGLLLLALLLPVQVSSFVPLTSMPEATAAETTKPSNSALQPTAGLLVVLLAL 
LHLYH 

Signal peptide: 

amino acids 15-27 



FIGURE 34 



GCCGCGGCGAGAGCGCGCCCAGCCCCGCCGCGMGCCCGCGCGCCCAGGACGCCTCCTCCCGCTGCTGGCCCGGC 
CGGCGGCCCTGACTGCGCTGCTGCTGCTGCTGCTGGGCCATGGCGGCGGCGGGCGCTGGGGCGCCCGGGCCCAGG 
AGGCGGCGGCGGCGGCGGCGGACGGGCCCCCCGCGGC7VGACGGCGAGGACGGACAGGACCCGCACAGCAAGCACC 
TGTAC^CGGCCGACATGTTCACGCACGGGATCCAGAGCGCCGCGCACTTCGTCATGTTCTTCGCGCCCTGGTGTG 
GACACTGCCAGCGGCTGCAGCCGACTTGGAATGACCTGGGAGACAA^ 

ATGTGGCTAAAGTGGACTGCACGGCCCACTCCGACGTGTGCTCCGCCCAGGGGGTGCGAGGATACCCCACCTTAA 
AGCTTTTCAAGCCAGGCCAAGAAGCTGTGAAGTACCAGGGTCCTCGGGACTTCCAGACACTGGAAAACTGGATGC 
TGCAGACACTGAACGAGGAGCC^GTGACACC^GAGCCGGAAGTGGAACCGCCCAGTGCCCCCGAGCTCAAGCaAG 
GGCTGTATGAGCTOTCAGCAAGCAACTTTGAGCTGCACGTTGCACAAGGCGACCACTTTATCAAGTTCTTCGCTC 
CGTGGTGTGGTCA.CTGCAAAGCCCTGGCTCCAACCTGGGAGCAGCTGGCTCTGGGCCTTGAACATTCCGAAACTG 
TCAAGATTGGCAAGGTTGATTGTACACAGCACTATGAACT 

TTCTCTGGTTCCGAGATGGGAAAAAGGTGGATCAGTACAAGGGAAAGCGGGATTTGGAGTCACTGAGGGAGTACG 
TGGAGTCGCAGCTGCAGCGCACAGAGACTGGAGCGACGGAGACCGTCACGCCCTCAGAGGCCCCGGTGCTGGCAG 
CTGAGCCCGAGGCTGACAAGGGCACTGTGTTGGCACTCACTGAAAATAACTTCGATGACACCATTGCAGAAGGAA 
TAACCTTCATCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTGGGAGGAACTCTCTA 
AAAAGGAATTCCCTGGTCTGGCGGGGGTCAAGATCGCCGAAGTAGACTGCACTGCTGAACGGAATATCTGCAGCA 
AGTATTCGGTACGAGGCTACCCCACGTTATTGCTTTTCCGAGGAGGGAAGAAAGTCAGTGAGCACAGTGGAGGCA 
GAGACCTTGACTCGTTACACCGCTTTGTCCTGAGC^ 

CTCTCCTGCCCAGCTCCCGGACCCTGCGTTTAGGAGTTCAGTCCCACAGAGGCCACTGGGTTCCCAGTGGTGGCT 
GTTCAGAAAGCAGAACATACTAAGCGTGAGGTATCTTC^ 

ATTCTTTATTAAGTTAAGTTTCTCTAAGTAAATGTGTAACTCATGGTCACTGTGTAAACATTTTCAGTGGCGATA 

TATCCCCTTTGACCTTOTCTTGATGAAATTTACATGGTTTCCTTTGAGACTAAAATAGCGTTGAGGGAAATGAAA 

TTGCTGGACTATTTGTGGCTCCTGAGTTGAGTGATTTTGGTGAAAGAAAGCACATCCAAAGCATAGTTTACCTGC 

CCACGAGTTCTGGAAAGGTGGCCTTGTGGCAGTATTGACGTTCCTCTGATCTTAAGGTCACAGTTGACTCAATAC 

TGTGTTGGTCCGTAGCATGGAGCAGATTGAAATGGRAAAACCCACACCTCTGGAAGATACCTTCACG^ 

TGGAGCTTCTGTTGCTGTGAATACTTCTCTCAGTGTGAGAGGTTAGCCGTGATGAAAGCAGCGTTACTTCTGACC 

GTGCCTGAGTAAGAGAATGCTGATGCCATAACTTTATGTGTCGATACTTGTCAAATCAGTTACTGTTCAGGGGAT 

CCTTCTGTTTCTC^CGGGGTGAAAGATGTCTC^ 

TGGATGTCTTCCTTAGAAAGGGTAGGCATGGAAAATTCO^CGAGGCTCATTCTCAGTATCTCATTAACTCATTGA 
AAGATTCCAGTTGTATTTGTCACCTGGGGTGACAAGACCAGACAGGCTTTCCCAGGCCTGGGTATCCAGGGAGGC 
TCTGCAGCCCTGCTGAAGGGCCCTAACTAGAGTTCTAGAGTTTCTGATTCTGTTTCTCAGTAGTCCTTTTAGAGG 
CTTGCTATACTTGGTOTGCTTCAAGGAGGTCGACCTTCTAATGTATGAAGAATGGGATGCATTTGATCTCAAGAC 
CAAAGACAGATGTCAGTGGGCTGCTCTGGCCCTGGTGTGCACGGCTGTGGCAGCTGTTGATGCCAGTGTCCTCTA 
ACTCATGCTGTCCTTGTGATTAAACACCTCTATCTCCCTTGGGAATAAGCACATACAGGCTTAAGCTCTAAGATA 
GATAGGTGTTTGTCCTTTTACCATCGAGCTACTTCCCAT^ 

CCCATACGCAAGGGGATGTGGATACTTGGCCCAAAGTAACTGGTGGTAGGAATCTTAGAAACAAGACCACTTATA 

CTGTCTGTCTGAGGCAGAAGATAACAGCAGC^TCTCGACCAGCCTCTGCCTTAAAGGAAATCTTTATTAATCACG 

TATGGTTCACAGATAATTCTTTTTTTAAAAAAACCCAACCTCCTAGAGAAGCACAACTGTCAAGAGTCTTGTACA 

CAC^CTTCAGCTTTGCATCACGAGTCTTGTATTCCAAGAAAATCAAAGTGGTAC^ 

GATACTTTCTAAATAAACTCTTTTTTTTTAA 



FIGURE 35 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA46776 
xsubunit 1 of 1, 432 aa, 1 stop 
><MW: 47629, pi: 5.90, NX(S/T): 0 

MPARPGRLLPLLARPAALTALLLLLLGHGGGGRWGARAQEAAAAAADGPPAADGEDGQDPHS 
KHLYTADMFTHGIQSAAHFVMFFAPWCGHCQRLQPT^DLGDKYNSMEDAKVYVAKVDCTAH 
SDVCSAQGVRGYPTLKLFKPGQEAVKYQGPRDFQTLENWMLQTLNEEPVTPEPEVEPPSAPE 
LKQGLYELSASNFELHVAQGDHFIKFFAPWCGHCKALAPTWEQLALGLEHSETVKIGKYDCT 
QHYELCSGNQVRGYPTLLWFRDGKKVDQYKGKRDLESLREYVESQLQRTETGATETVTPSEA 
PVLAAE PE ADKGTVLALTENNFDDT I AEGITF I KF YAPWCGHCKTLAPTWEEIiS KKEFPGLA 
GVKIAEVDCTAERNICSKYSVRGYPTLLLFRGGKKVSEHSGGRDLDSLHRFVLSQAKDEL 



Signal sequence: 

amino acids 1-32 



FIGURE 3 6 

CTTTTCTGAGGAACCACAGCaATGAATGGCTTTGCATCCTTGCTTCGAAGAAACCAATTTAT 
CCTCCTGGTACTATTTCTTTTGCAAATTCAGAGTCTGGGTCTGGATATTGATAGCCGTCCTA 
CCGCTGAAGTCTGTGCCACACACACAATTTCACCAGGACCCAAAGGAGATGATGGTGAAAAA 
GGAGATCCAGGAGAAGAGGGAAAGCATGGCAAAGTGGGACGCATGGGGCCGAAAGGAATTAA 
AGGAGAACTGGGTGATATGGGAGATCAGGGCAATATTGGCAAGACTGGGCCCATTGGGAAGA 
AGGGTGACAAAGGGGAAAAAGGTTTGCTTGGAATACCTGGAGAAAAAGGCAAAGCAGGTACT 
GTCTGTGATTGTGGAAGATACCGGAAATTTGTTGGACAACTGGATATTAGTATTGCTCGGCT 
CAAGACATCTATGAAGTTTGTCAAGAATGTGATAGCAGGGATTAGGGAAACTGAAGAGAAAT 
TCTACTACATCGTGCAGGAAGAGAAGAACTACAGGGAATCCCTAACCCACTGCAGGATTCGG 
GGTGGAATGCTAGCCATGCCCAAGGATGAAGCTGCCAACACACTCATCGCTGACTATGTTGC 
CAAGAGTGGCTTCTTTCGGGTGTTCATTGGCGTGAATGACCTTGAAAGGGAGGGACAGTACA 
TGTCCACAGACAACACTCCACTGCAGAACTATAGCAACTGGAATGAGGGGGAACCCAGCGAC 
CCCTATGGTCATGAGGACTGTGTGGAGATGCTGAGCTCTGGCAGATGGAATGACACAGAGTG 
CCATCTTACCATGTACTTTGTCTGTGAGTTCATCAAGAAGAAAAA GTAA CTTCCCTCATCCT 



ATTGTACTACATTTGATCTGAGTCAACATAGCTAGAAAATGCTAAACTGAGGTATGGAGCCT 
CCATCATCAAAAAAAAAAAAAAAA 



FIGURE 37 

></usr/seqdb2/sst/DNA/Dnasegs.min/ss.DNA50980 
xsubunit 1 of 1, 277 aa, 1 stop 
><MW: 30645, pi: 7.47, NX(S/T): 2 

MNGFASLLRRNQFILLVLFLLQIQSLGLDIDSRPTAEVCATHTISPGPKGDDGEKGDPGEEG 
KHGKVGRMGPKGIKGELGDMGDQGNIGKTGPIGIOCGDKGEKGLLGIPGEKGKAGTVCDCGRY 
RKFVGQLDISIARLKTSMKFVKWIAGIRETEEKFYYIVQEEKNYRESLTHCRIRGGMLAMP 
KDEAANTLIADYVAKSGFFRVFIGVNDLEREGQYMSTDNTPLQNYSNWNEGEPSDPYGHEDC 
VEMLSSGRWNDTECHLTMYFVCEFIKKKK 

Signal peptide: 

amino acids 1-25 



FIGURE 3 8 

GGTTCTATCGATTCGAATTCGGCCACACTGGCCGGATCCTCTAGAGATCCCTCGACCTCGAC 
CCACGCGTCCGCTGCTCTCCGCCCGTGTGGAGTGGTGGGGGCCTGGGTGGGAATGGGCGTGT 
GCCAGCGCACGCGCGCTCCCTGGAAGGAGAAGTCTCAGCTAGAACGAGCGGCCCTAGGTTTT 
CGGAAGGGAGGATCAGGGATGTTTGCGAGCGGCTGGAACCAGACGGTGCCGATAGAGGAAGC 
GGGCTCCATGGCTGCCCTCCTGCTGCTGCCCCTGCTGCTGTTGCTACCGCTGCTGCTGCTGA 
AGCTACACCTCTGGCCGCAGTTGCGCTGGCTTCCGGCGGACTTGGCCTTTGCGGTGCGAGCT 
CTGTGCTGCAAAAGGGCTCTTCGAGCTCGCGCCCTGGCCGCGGCTGCCGCCGACCCGGAAGG 
TCCCGAGGGGGGCTGCAGCCTGGCCTGGCGCCTCGCGGAACTGGCCCAGCAGCGCGCCGCGC 
ACACCTTTCTCATTCACGGCTCGCGGCGCTTTAGCTACTCAGAGGCGGAGCGCGAGAGTAAC 
AGGGCTGCACGCGCCTTCCTACGTGCGCTAGGCTGGGACTGGGGACCCGACGGCGGCGACAG 
CGGCGAGGGGAGCGCTGGAGAAGGCGAGCGGGCAGCGCCGGGAGCCGGAGATGCAGCGGCCG 
GAAGCGGCGCGGAGTTTGCCGGAGGGGACGGTGCCGCCAGAGGTGGAGGAGCCGCCGCCCCT 
CTGTCACCTGGAGCAACTGTGGCGCTGCTCCTCCCCGCTGGCCCAGAGTTTCTGTGGCTCTG 
GTTCGGGCTGGCCAAGGCCGGCCTGCGCACTGCCTTTGTGCCCACCGCCCTGCGCCGGGGCC 
CCCTGCTGCACTGCCTCCGCAGCTGCGGCGCGCGCGCGCTGGTGCTGGCGCCAGAGTTTCTG 
GAGTCCCTGGAGCCGGACCTGCCCGCCCTGAGAGCCATGGGGCTCCACCTGTGGGCTGCAGG 
CCCAGGAACCCACCCTGCTGGAATTAGCGATTTGCTGGCTGAAGTGTCCGCTGAAGTGGATG 
GGCCAGTGCCAGGATACCTCTCTTCCCCCCAGAGCATAACAGACACGTGCCTGTACATCTTC 
ACCTCTGGCACCACGGGCCTCCCCAAGGCTGCTCGGATCAGTCATCTGAAGATCCTGCAATG 
CCAGGGCTTCTATCAGCTGTGTGGTGTCCACCAGGAAGATGTGATCTACCTCGCCCTCCCAC 
TCTACCACATGTCCGGTTCCCTGCTGGGCATCGTGGGCTGCATGGGCATTGGGGCCACAGTG 
GTGCTGAAATCCAAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGCACAGGGTGAC 
GGTGTTCCAGTACATTGGGGAGCTGTGCCGATACCTTGTCAACCAGCCCCCGAGCAAGGCAG 
AACGTGGCCATAAGGTCCGGCTGGCAGTGGGCAGCGGGCTGCGCCCAGATACCTGGGAGCGT 
TTTGTGCGGCGCTTCGGGCCCCTGCAGGTGCTGGAGACATATGGACTGACAGAGGGCAACGT 
GGCCACCATCAACTACACAGGACAGCGGGGCGCTGTGGGGCGTGCTTCCTGGCTTTACAAGC 
ATATCTTCCCCTTCTCCTTGATTCGCTATGATGTCACCACAGGAGAGCCAATTCGGGACCCC 
CAGGGGCACTGTATGGCCACATCTCCAGGTGAGCCAGGGCTGCTGGTGGCCCCGGTAAGCCA 
GCAGTCCCCATTCCTGGGCTATGCTGGCGGGCCAGAGCTGGCCCAGGGGAAGTTGCTAAAGG 
ATGTCTTCCGGCCTGGGGATGTTTTCTTCAACACTGGGGACCTGCTGGTCTGCGATGACCAA 
GGTTTTCTCCGCTTCCATGATCGTACTGGAGACACCTTCAGGTGGAAGGGGGAGAATGTGGC 
CACAACCGAGGTGGCAGAGGTCTTCGAGGCCCTAGATTTTCTTCAGGAGGTGAACGTCTATG 
GAGTCACTGTGCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCCTAGTTCTGCGTCCCCCC 
CACGCTTTGGACCTTATGCAGCTCTACACCCACGTGTCTGAGAACTTGCCACCTTATGCCCG 
GCCCCGATTCCTCAGGCTCCAGGAGTCTTTGGCCACCACAGAGACCTTCAAACAGCAGAAAG 
TTCGGATGGCAAATGAGGGCTTCGACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGAC 
CAGGCTGTAGGTGCCTACCTGCCCCTCACAACTGCCCGGTACAGCGCCCTCCTGGCAGGAAA 
CCTTCGAATCTGAGAACTTCCACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGGGG 
CCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTATACCAGAACTGCGGTCACTATTTT 
GTAATAAATGTGGCTGGAGCTGATCCAGCTGTCTCTGACCTAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCTGCAGTAGGGATAACAGGGTAATAAGC 
TTGGCCGCCATGGCCCAACTTGTTTATTGCAG 



FIGURE 3 9 

></usr/seqd±>2/sst/DNA/Dnaseqs.min/ss.DNA50913 
xsubunit 1 of 1, 730 aa, 1 stop 
><MW: 78644, pi: 7.65, NX(S/T): 2 

MGVCQRTRAPWKEKSQLERAALGFRKGGSGMFASGWNQTVPIEEAGSMAALLLLPLLLLLPL 
LLLKLHLWPQLRWLPADIAFAVRALCCKRALRA 

RAAHTFLIHGSRRFSYSEAERESNRAARAFLRALGWDWGPDGGDSGEGSAGEGERAAPGAGD 
AAAGSGAEFAGGDGAARGGGAAAPLSPGATVALLLPAGPEFLWLWFGLAKAGLRTAFVPTAL 
RRGPLLHCLRSCGARALVLAPEFLESLEPDLPALRAMGLHLWAAGPGTHPAGISDLLAEVSA 
EVDGPVPGYLSS PQS I TDTCLY I FTSGTTGLPKAARI SHLKI LQCQGFYQLCGVHQEDVI YL 
ALPLYHMSGSLLGIVGCMGIGATWLKSKFSAGQFWEDCQQHRVTVFQYIGELCRYLVNQPP 
SKAERGHKVRLAVGSGLRPDTWERFWRFGPLQVLETYGLTEGNVATINYTGQRGAVGRASW 
LYKHI FPFSL I RYDVTTGE P I RD PQGHCMATS PGEPGLLVAPVSQQS PFLGYAGGPELAQGK 
LLKDVFRPGDVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGENVATTEVAEVFEALDFLQEV 
NVYGVTVPGHEGRAGMAALVLRPPHALDLMQLYTHVSENLPPYARPRFLRLQESLATTETFK 
QQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARYSALLAGNLRI 

Type II transmembrane domain: 

amino acids 45-65 

Other transmembrane domain : 

amino acids 379-398 

cAMP- and cGMP- dependent protein kinase phosphorylation site 

starting at amino acid 136 

CUB domain protein motif 

amino acids 254-261 

putative AMP-binding domain siganture 

amino acids 332-343 



N-glycosylation sites 

amino acids 37-40 and 483-486 



FIGURE 40 

CCTGTGTTAAGCTGAGGTTTCCCCTAGATCTCGTATATCCCCAACACATACCTCCACGCACA 
CACATCCCCAAGAACCTCGAGCTCACACCAACAGACACACGCGCGCATACACACTCGCTCTC 
GCTTGTCCATCTCCCTCCCGGGGGAGCCGGCGCGCGCTCCCACCTTTGCCGCACACTCCGGC 
GAGCCGAGCCCGCAGCGCTCCAGGATTCTGCGGCTCGGAACTCGGATTGCAGCTCTGAACCC 
CCATGGTGGTTTTTTAAACACTTCTTTTCCTTCTCTTCCTCGTTTTGATTGCACCGTTTCCA 
TCTGGGGGCTAGAGGAGCAAGGCAGCAGCCTTCCCAGCCAGCCCTTGTTGGCTTGCCATCGT 
CCATCTGGCTTATAAAAGTTTGCTGAGCGCAGTCCAGAGGGCTGCGCTGCTCGTCCCCTCGG 
CTGGCAGAAGGGGGTGACGCTGGGCAGCGGCGAGGAGCGCGCCGCTGCCTCTGGCGGGCTTT 
CGGCTTGAGGGGCAAGGTGAAGAGCGCACCGGCCGTGGGGTTTACCGAGCTGGATTTGTATG 
TTGCACCATGCCTTCTTGGATCGGGGCTGTGATTCTTCCCCTCTTGGGGCTGCTGCTCTCCC 
TCCCCGCCGGGGCGGATGTGAAGGCTCGGAGCTGCGGAGAGGTCCGCCAGGCGTACGGTGCC 
AAGGGATTCAGCCTGGCGGACATCCCCTACCAGGAGATCGCAGGGGAACACTTAAGAATCTG 
TCCTCAGGAATATACATGCTGCACCACAGAAATGGAAGACAAGTTAAGCCAACAAAGCAAAC 
TCGAATTTGAAAACCTTGTGGAAGAGACAAGCCATTTTGTGCGCACCACTTTTGTGTCCAGG 
CATAAGAAATTTGACGAATTTTTCCGAGAGCTCCTGGAGAATGCAGAAAAGTCACTAAATGA 
TATGTTTGTACGGACCTATGGCATGCTGTACATGCAGAATTCAGAAGTCTTCCAGGACCTCT 
TCACAGAGCTGAAAAGGTACTACACTGGGGGTAATGTGAATCTGGAGGAAATGCTCAATGAC 
TTTTGGGCTCGGCTCCTGGAACGGATGTTTCAGCTGATAAACCCTCAGTATCACTTCAGTGA 
AGACTACCTGGAATGTGTGAGCAAATACACTGACCAGCTCAAGCCATTTGGAGACGTGCCCC 
GGAAACTGAAGATTCAGGTTACCCGCGCCTTCATTGCTGCCAGGACCTTTGTCCAGGGGCTG 
ACTGTGGGCAGAGAAGTTGCAAACCGAGTTTCCAAGGTCAGCCCAACCCCAGGGTGTATCCG 
TGCCCTCATGAAGATGCTGTACTGCCCATACTGTCGGGGGCTTCCCACTGTGAGGCCCTGCA 
ACAACTACTGTCTCAACGTCATGAAGGGCTGCTTGGCAAATCAGGCTGACCTCGACACAGAG 
TGGAATCTGTTTATAGATGCAATGCTCTTGGTGGCAGAGCGACTGGAGGGGCCATTCAACAT 
TGAGTCGGTCATGGACCCGATAGATGTCAAGATTTCTGAAGCCATTATGAACATGCAAGAAA 
ACAGCATGCAGGTGTCTGCAAAGGTCTTTCAGGGATGTGGTCAGCCCAAACCTGCTCCAGCC 
CTCAGATCTGCCCGCTCAGCTCCTGAAAATTTTAATACACGTTTCAGGCCCTACAATCCTGA 
GGAAAGACCAACAACTGCTGCAGGCACAAGCTTGGACCGGCTGGTCACAGACATAAAAGAGA 
AATTGAAGCTCTCTAAAAAGGTCTGGTCAGCATTACCCTACACTATCTGCAAGGACGAGAGC 
GTGACAGCGGGCACGTCCAACGAGGAGGAATGCTGGAACGGGCACAGCAAAGCCAGATACTT 
GCCTGAGATCATGAATGATGGGCTCACCAACCAGATCAACAATCCCGAGGTGGATGTGGACA 
TCACTCGGCCTGACACTTTCATCAGACAGCAGATTATGGCTCTCCGTGTGATGACCAACAAA 
CTAAAAAACGCCTACAATGGCAATGATGTCAATTTCCAGGACACAAGTGATGAATCCAGTGG 
CTCAGGGAGTGGCAGTGGGTGCATGGATGACGTGTGTCCCACGGAGTTTGAGTTTGTCACCA 
CAGAGGCCCCCGCAGTGGATCCCGACCGGAGAGAGGTGGACTCTTCTGCAGCCCAGCGTGGC 
CACTCCCTGCTCTCCTGGTCTCTCACCTGCATTGTCCTGGCACTGCAGAGACTGTGCAGATA 
ATCTTGGGTTTTTGGTCAGATGAAACTGCATTTTAGCTATCTGAATGGCCAACTCACTTCTT 
TTCTTACACTCTTGGACAATGGACCATGCCACAAAAACTTACCGTTTTCTATGAGAAGAGAG 
CAGTAATGCAATCTGCCTCCCTTTTTGTTTTCCCAAAGAGTACCGGGTGCCAGACTGAACTG 
CTTCCTCTTTCCTTCAGCTATCTGTGGGGACCTTGTTTATTCTAGAGAGAATTCTTACTCAA 
ATTTTTCGTACCAGGAGATTTTCTTACCTTCATTTGCTTTTATGCTGCAGAAGTAAAGGAAT 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50914 
xsubunit 1 of 1, 555 aa, 1 stop 
><MW: 62736, pi: 5.36, NX(S/T): 0 

MPS WI GAVI LPLLGLLL S L PAGADVKARS CGE VRQAYGAKGFSLAD I PYQE I AGEHLRI CPQ 
EYTCCTTEMEDKLSQQSKLEFENLVEETSHFVRTTFVSRHKKFDEFFRELLENAEKSLNDMF 
VRTYGMLYMQNSEVFQDLFTELKRYYTGGNVNLEEMLNDFWARLLERMFQLINPQYHFSEDY 
LECVSKYTDQLKPFGDVPRKLKI QVTRAF I AARTFVQGLTVGREVANRVS KVS PTPGC IRAL 
MKMLYCPYCRGLPTVRPCNI^CLNVMKGCLANQADLJDTEWNIiFIDAM 

VMDPIDVKISEAIMNMQENSMQVSAKVFQGCGQPKPAPALRSARSAPENFNTRFRPYNPEER 
PTTAAGTSLDRLVTDIKEKLKLSKKVWSALPYTICKDESVTAGTSNEEECWNGHSKARYLPE 
IMNDGLTNQ INNPE VDVD I TRPDTF I RQQ IMALRVMTNKLKNAYNGNDVNFQDTSDES SGSG 
SGSGCMDDVCPTEFEFVTTEAPAVDPDRREVDSSAAQRGHSLLSWSLTCIVLALQRDCR 

Signal peptide: 

amino acids 1-23 
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CGGACGCX3T®Ma3GAa3C^ 
CAGTTTGCAGCGCCTGCGCCGGGTGCGCC^^ 

TAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAGCGGGGAAAAGGAATTGCTGCCCGGAGTTTCTGCGGAGGT 
GGAGGGAGATCAGGAAACGGCTTCTTCCTCACTTCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGG 
AAAGGACTGGGGAAAATAGCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTG 
TCTGATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCGCAGGACAGACGGCCGA 
TCCCGCCGCCCTCCGTACCAGCACTCCC^GGAGAGTCAGCCTCGCTCCCCAACGTCGAGGGCGCTCTGGCCACGA 
AAAGTTCCTGTCCACTGTGATTCTCAATTCCTTGCTTGGTTTTTTTCTCCAGAGAACTTTTGGGTGGAGATATTA 

TGGAGAAGAGCGAGCCCTCCTTGTTCTTCCGGAGTCCCATCCATTAAGCCATCACTTCTGGAAGATTAAAGTTGT 

CGGACATGGTGACAGCTGAGAGGAGAGGAGGATTTCTTGCCAGGTGGAGAGTCTTCACCGTCTGTTGGGTGCATG 

TGTGCGCCCGCAGCGGCGCGGGGCGCGTGGTTCTCCGCGTGGAGTOTCACCTGGGACCTGAGTGAATGGCTCCCA 

GGGGCTGTGCGGGGCaTCCGCCTCCGCCTTCTCCACAGGCCTGTGTCTGTCCTGGAAAGATGCTAGCAATGGG^ 

CGCTGGCAGGATTCTGGATCCTCTGCCTCCTCACTTATGGTTACCTGTCCTGGGGCCAGGCCTTAGAAGAGGAGG 

AAGAAGGGGCCTTACTAGCTC^GCTGGAGAGAAACTAGAGCCC^GCACAACTTCC^CCTCCCAGCCCCATCTCA 

TTTTCATCCTAGCGGATGATCAGGGATTTAGAGATGTGGGTTACCACGGATCTGAGATTAAAAGACCTACTCTTG 

AC^GCTCGCTGCCGAAGGAGTTAAACTGGAGAACTACTATGTCCAGCCTATTTGCACACC^TCCAGGAGTCAGT 

TTATTACTGGAAAGTATC^GATACACACCGGACTTCAACATTCTATCATAAGACCTACCCA^ 

CTCTGGACAATGCCACCCTACCTCAGAAACTGAAGGAGGTTGGATATTCAACGCATATGGTCGGAAAATGGCACT 

TGGGTTTTAACAGAAAAGAATGCATGCCCACCAGAAGAGGATTTGATACCTTTTTTGGTTCCOTTTTGGGAAGTG 

GGGATTACTATACAC^CTACAAATGTGACAGTCCTGGGATGTGTGGCTATGACTTGTATGAAAACGAC^ 

CCTGGGACTATGACAATGGC^TATACTCCA'CACAGATGTACACTCAGAGAGTACAGCAAATCTTA 

ACCCCACAAAGCCTATATTTTTATATACTGCCTATCAAGCTGTTCATTCACCACTGCAAGCTCCTGGCAGGTATT 

TOGAACACEKXX»TCC»^ 

TCAACAACGTGAC^TTGGCTCTAAAGACTTATGGTTTCTATAAC^ 

GTGGCGAGCCTACGGC^GGAGGGAGTAACT(MCCTCTCAGAGGTAGCAAAGGAACATATTGGGAAGGAGGGATCC 

GGGCTGTAGGCTTTGTGCATAGCCCACTTCTGAAAAACAAGGGAACAGTGTGTAAGGAACTTGTGCACAT^ 

ACTGGTACCCCaCTCTCATTTC^CTGGCTGAAGGACAGATTGATGAGGACATTCAACTAGATGGCTATGATATC^ 

GGGAGACCATAAGTGAGGGTCTTCGCTCACCCCGAGTAGATATTTTGCATAACATTGACCCCTATACACCAAGGC 

AAAAAATGKSCTCCTGGGCAGCAGGCTATGGGATC^ 

GAAATTGCTTACAGGAAATCCTGGCTACAGCGACTGGGTCCCCCCTCAGTCTTTC^GCAACCTGGGACCGAACCG 

GTGGCACAATGAACGGATCACCTTGTCAACTGGCAAAAGTGTATGGCTTTTCAACATCACAGCCGACCCAT^ 

GAGGGTGGACCTATCTAAC^GGTATCCAGGAATCGTGAAGAAGCTCCTACGGAGGCTCTCACAGTTCAAC^ 

TGCAGTGCCGGTCAGGTATCCCCCCAAAGACCCCAGAAGTAACCCTAGGCTCAATGGAGGGGTCTGGGGACCATG 

GTATAAAGAGGAAACCAAGAAAAAGAAGCCAAGCAAAAATCAGGCTGAGAAAAAGCAAAAGAAAA 

GAAGAAGAAAC!AGCAGAAAGCAGTCTCAGGTAAACCAGCAAATTTGGCTCGATAATATCGCTGGCCTAAGCGTCA 

GGCTTGTTTTCATGCTGTGCCACTCCAGAGACTTCTGCCACCTGGCCGCCACACTGAAAACTGTCCTGCTCAGTG 

CC^AGGTGCTACTCTTGCAAGCCACACTTAGAGAGAGTGGAGATGTTTATTTCTCTCGCTCCTTTAGAAAACGTG 

GTGAGTCCTGAGTTCCACTGCTGTGCTTCAGTCAACTGACCAAACACTGCTTTGAATTATAGGAGGAGAACAATA 

ACCTACCATCCGCAAGCATGCTAATTTGATGGAAGTTACAGGGTAGCATGATTAAAACTACCTTTGAT^ 



FIGURE 42B 



AGTa^GATTGTGTCmCCTO^GGCCTTQ^GAATATATTTTCTTGGTGAATTTTTGTATGTCTGTCATATGA 
CACTTGGGTTTTTTAATTAATTCTATTTTATATATATAAATATATGTTTCTTTTCCTGTGAAAAGCTGTTTTTCT 
CACATGTGAACAGCTTGCACCTCATTTTACCATGCGTG^ 

ACAATGAATGTAACTATTTTCTAAAC^CTTTACTAGAAGAACATTTCAGTATAAAAAACCTAATTTATTTTTACA 
GAAAAATATTTTGTTGTTTTTATAAAAAGTTATGCAAATGACTTO 
ATTTTATTTCATTTCTTCAAATTATCAAGCACTGTAATACT 
TAAAAAACATCATTCAGAAAACTTTATAATCGTCATTC 

ATTACTTGGAAATTCAATGTTTGTGCAGAGTTGAGACAACTTTATTGTTTCTATCATAAACTATTTATGTATCTT 
AATTATTAAAATGATTTACTTTATGGCACTAGAAAATTTACTGTGGCTTTTCTGATCTAACTTCTAGCTAAAATT 
GTATCATTGGTCCTAAAAAATAAAAATCTTTACTAATAGGC^TTGAAGGAATGGTTTGCTAAC^CC^CAGTAA 
TATAATATGATTTTACAGATAGATGCTTCCCCTTGG 

TATTTATATTAGGTTGGTGCAAAACTAGTTGCGGTTTTTCCCATTAAaAGTAATAACCTTACTCTTATACAAAGT 
GGAC^CTGTGGGGAGATACAGAGAAATGGAAGATACGGATCCTGCCTGGAGTAGGTAACCTTGCTTGGAAACCCC 
ACATGCAAACGTCATGAGGAGAATTAAAGGAGTATTATCAGTAATGAAGTTTATCATGGGTCATCAATGAGCATA 
GATTGGTGTGGATCCTGTAGACCCTGGTGTTTTCTTTGAAGTGCCCTCTCCTAATGCAGAGGCCTTGAAGCTTAC 
AGTATACACTTGAAAAGTCACAGATAGCTAGAATTATGATCTTTGAAGTTATAACTGTGATCTGAAAATGTGTGT 
GGTGGTATGACAGCATACCATTAAATACATTTACATCACAGCTCAAAGGACTGTGATATAATCCATTTATATCAC 
AACTC^AAGGACTGTGATATAATCCATTTATATCACAGCT 

CTAGTACTGAAATTACTAAATTGGGTAAGATGATTTAAATGATTTTAATTTTAACATTTTATTTCTAGAATATAT 
GGCTCCATTTTATTTTATAGTGTAAAGTTGTATTTCCTAAAGTTTGTGTTTTGTCGACAGTATCTTTTAAATGAG 
TCTTAAAAATAAAGGC^TATTGTTC^TGTTTAAAAAAAAAAAAAAftAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48296 
xsubunit 1 of 1, 515 aa, 1 stop 
><MW: 56885, pi: 6.49, NX(S/T): 5 

MAPRGCAGHPPPPSPQACVCPGKMLAMGALAGFWILCLLTYGYLSWGQALEEEEEGALLAQA 
GEKLEPSTTSTSQPHLIFILADDQGFRDVGYHGSEIKTPTLDKLAAEGVKLENYYVQPICTP 
SRSQFITGKYQIHTGLQHS I IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKEC 
MPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 
I LASHNPTKP I FLYTAYQAVHSPLQAPGRYFEHYRS I ININRRRYAAMLSCLDEAINNVTLA 
LKTYGFYNNS 1 1 1 YS SDNGGQPTAGGSNWPLRGS KGTYWEGG I RAVGFVHS PLLKNKGTVCK 
E LVH I TDWYPTL I S LAEGQIDED I QLDGYD I WET I S EGLRS PRVD I LHN IDP YTPRQKMAPG 
QQAMGSGTLQSSQPSECSTGNCLQEILATATGSPLSLSATWDRTGGTMNGSPCQLAKVYGFS 
TSQPTHMRGWTYLTGIQES 

Important Features : 
Signal Peptide: 

amino acids 1-37 

Sulfatases signature 1. 

amino acids 120-132 

Sulfatases signature 2 . 

amino acids 168-177 

Tyrosine kinase phosphorylation site. 

amino acids 163-169 

N-glycosylation sites. 

amino acids 157-160, 306-309 and 318-321 
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CGGACGCGTGGGTGCGAGTGGAGCGGAGGACCCGAGCGGCTGAGGAGAGAGGAGGCGGCGGC 
TTAGCTGCTACGGGGTCCGGCCGGCGCCCTCCCGAGGGGGGCTCAGGAGGAGGAAGGAGGAC 
CCGTGCGAGAATGCCTCTGCCCTGGAGCCTTGCGCTCCCGCTGCTGCTCTCCTGGGTGGCAG 
GTGGTTTCGGGAACGCGGCCAGTGCAAGGCATCACGGGTTGTTAGCATCGGCACGTCAGCCT 
GGGGTCTGTCACTATGGAACTAAACTGGCCTGCTGCTACGGCTGGAGAAGAAACAGCAAGGG 
AGTCTGTGAAGCTACATGCGAACCTGGATGTAAGTTTGGTGAGTGCGTGGGACCAAACAAAT 
GCAGATGCTTTCCAGGATACACCGGGAAAACCTGCAGTCAAGATGTGAATGAGTGTGGAATG 
AAACCCCGGCCATGCCAACACAGATGTGTGAATACACACGGAAGCTACAAGTGCTTTTGCCT 
CAGTGGCCACATGCTCATGCCAGATGCTACGTGTGTGAACTCTAGGACATGTGCCATGATAA 
ACTGTCAGTACAGCTGTGAAGACACAGAAGAAGGGCCACAGTGCCTGTGTCCATCCTCAGGA 
CTCCGCCTGGCCCCAAATGGAAGAGACTGTCTAGATATTGATGAATGTGCCTCTGGTAAAGT 
CATCTGTCCCTACAATCGAAGATGTGTGAACACATTTGGAAGCTACTACTGCAAATGTCACA 
TTGGTTTCGAACTGCAATATATCAGTGGACGATATGACTGTATAGATATAAATGAATGTACT 
ATGGATAGCCATACGTGCAGCCACCATGCCAATTGCTTCAATACCCAAGGGTCCTTCAAGTG 
TAAATGCAAGCAGGGATATAAAGGCAATGGACTTCGGTGTTCTGCTATCCCTGAAAATTCTG 
TGAAGGAAGTCCTCAGAGCACCTGGTACCATCAAAGACAGAATCAAGAAGTTGCTTGCTCAC 
AAAAACAGCATGAAAAAGAAGGCAAAAATTAAAAATGTTACCCCAGAACCCACCAGGACTCC 
TACCCCTAAGGTGAACTTGCAGCCCTTCAACTATGAAGAGATAGTTTCCAGAGGCGGGAACT 
CTCATGGAGGTAAAAAAGGGAATGAAGAGAAATGAAAGAGGGGCTTGAGGATGAGAAAAGAG 
AAGAGAAAGCCCTGAAGAATGACATAGAGGAGCGAAGCCTGCGAGGAGATGTGTTTTTCCCT 
AAGGTGAATGAAGCAGGTGAATTCGGCCTGATTCTGGTCCAAAGGAAAGCGCTAACTTCCAA 
ACTGGAACATAAAGATTTAAATATCTCGGTTGACTGCAGCTTCAATCATGGGATCTGTGACT 
GGAAACAGGATAGAGAAGATGATTTTGACTGGAATCCTGCTGATCGAGATAATGCTATTGGC 
TTCTATATGGCAGTTCCGGCCTTGGCAGGTCACAAGAAAGACATTGGCCGATTGAAACTTCT 
CCTACCTGACCTGCAACCCCAAAGCAACTTCTGTTTGCTCTTTGATTACCGGCTGGCCGGAG 
ACAAAGTCGGGAAACTTCGAGTGTTTGTGAAAAACAGTAACAATGCCCTGGCATGGGAGAAG 
ACCACGAGTGAGGATGAAAAGTGGAAGACAGGGAAAATTCAGTTGTATCAAGGAACTGATGC 
TACCAAAAGCATCATTTTTGAAGCAGAACGTGGCAAGGGCAAAACCGGCGAAATCGCAGTGG 
ATGGCGTCTTGCTTGTTTCAGGCTTATGTCCAGATAGCCTTTTATCTGTGGATGACTGAATG 
TTACTATCTTTATATTTGACTTTGTATGTCAGTTCCCTGGTTTTTTTGATATTGCATCATAG 
GACCTCTGGCATTTTAGAATTACTAGCTGAAAAATTGTAATGTACCAACAGAAATATTATTG 
TAAGATGCCTTTCTTGTATAAGATATGCCAATATTTGCTTTAAATATCATATCACTGTATCT 
TCTCAGTCATTTCTGAATCTTTCCNCATTATATTATAAAATNTGGAAANGTCAGTTTATCTC 
CCCTCCTCNGTATATCTGATTTGTATANGTANGTTGATGNGCTTCTCTCTACAACATTTCTA 
GAAAATAGAAAAAAAAGCACAGAGAAATGTTTAACTGTTTGACTCTTATGATACTTCTTGGA 
AACTATGACATCAAAGATAGACTTTTGCCTAAGTGGCTTAGCTGGGTCTTTCATAGCCAAAC 
TTGTATATTTAATTCTTTGTAATAATAA 
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MPLPWSIALPLLLSWVAGGFGNAASARHHGLLASARQPGVCHYGTKLACCYGWRRNSKGVCE 
ATCEPGCKFGECVGPNKCRCFPGYTGKTCSQDVNECGMKPRPCQHRCVNTHGSYKCFCLSGH 
MLMPDATCVNSRTCAMINCQYSCEDTEEGPQCLCPSSGLRLAPNGRDCLDIDECASGKVICP 
YNRRCVNTFGSYYCKCHIGFELQYISGRYDCIDINECTMDSHTCSHHANCFNTQGSFKCKCK 
QGYKGNGLRCSAIPENSVKEVLRAPGTIKDRIKKLLAHKNSMKECKAKIKNVTPEPTRTPTPK 
VNLQPFNYEE I VSRGGNSHGGKKGNEEK 

Signal peptide: 
amino acids 1-21 

EGF-like domain cysteine pattern signature. 

amino acids 8 0-91 

Calcium-binding EGF-like domains 

amino acids 103-124, 230-251 and 185-206 
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GGGAGCTGCTGCTGTGGCTGCTGGTGCTGTGCGCGCTGCTCCTGCTCTTGGTGCAGCTGCTG 
CGCTTCCTGAGGGCTGACGGCGACCTGACGCTACTATGGGCCGAGTGGCAGGGACGACGCCC 
AGAATGGGAGCTGACTGATATGGTGGTGTGGGTGACTGGAGCCTCGAGTGGAATTGGTGAGG 
AGCTGGCTTACCAGTTGTCTAAACTAGGAGTTTCTCTTGTGCTGTCAGCCAGAAGAGTGCAT 
GAGCTGGAAAGGGTGAAAAGAAGATGCCTAGAGAATGGCAATTTAAAAGAAAAAGATATACT 
TGTTTTGCCCCTTGACCTGACCGACACTGGTTCCCATGAAGCGGCTACCAAAGCTGTTCTCC 
AGGAGTTTGGTAGAATCGACATTCTGGTCAACAATGGTGGAATGTCCCAGCGTTCTCTGTGC 
ATGGATACCAGCTTGGATGTCTACAGAAAGCTAATAGAGCTTAACTACTTAGGGACGGTGTC 
CTTGACAAAATGTGTTCTGCCTCACATGATCGAGAGGAAGCAAGGAAAGATTGTTACTGTGA 
ATAGCATCCTGGGTATCATATCTGTACCTCTTTCCATTGGATACTGTGCTAGCAAGCATGCT 
CTCCGGGGTTTTTTTAATGGCCTTCGAACAGAACTTGCCACATACCCAGGTATAATAGTTTC 
TAACATTTGCCCAGGACCTGTGCAATCAAATATTGTGGAGAATTCCCTAGCTGGAGAAGTCA 
CAAAGACTATAGGCAATAATGGAGACCAGTCCCACAAGATGACAACCAGTCGTTGTGTGCGG 
CTGATGTTAATCAGCATGGCCAATGATTTGAAAGAAGTTTGGATCTCAGAACAACCTTTCTT 
GTTAGTAACATATTTGTGGCAATACATGCCAACCTGGGCCTGGTGGATAACCAACAAGATGG 
GGAAGAAAAGGATTGAGAACTTTAAGAGTGGTGTGGATGCAGACTCTTCTTATTTTAAAATC 
TTTAAGACAAAACATGACTGAAAAGAGCACCTGTACTTTTCAAGCCACTGGAGGGAGAAATG 
GAAAACATGAAAACAGCAATCTTCTTATGCTTCTGAATAATCAAAGACTAATTTGTGATTTT 
ACTTTTTAATAGATATGACTTTGCTTCCAACATGGAATGAAATAAAAAATAAATAATAAAAG 
ATTGCCATGAATCTTGCAAAA 



FIGURE 47 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA36343 
xsubunit 1 of 1, 289 aa, 1 stop 
><MW: 32268, pi: 9.21, NX(S/T): 0 

MWWVTGASSGIGEELAYQLSKLGVSLVLSARRVHELERVKRRCLENGNLKEKDILVLPLDL 
TDTGSHEAATKAVLQE FGRI D I LVNNGGMSQRSLCMDTS LDVYRKL I ELNYLGTVSLTKCVL 
PHMI ERKQGKI VTVNS ILGII S VPLS IGYCASKHALRGFFNGLRTELATYPGI I VSNI CPGP 
VQSNIVENSLAGEWKTIGNNGDQSHKMTTSRCWLMLISMANDLKEVWISEQPFLLVTYLW 
QYMPTWAWWI TNKMGKKRI ENFKSGVDADSS YFKI FKTKHD 

Important Features: 
Signal Peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 136-157 

Tyrosine kinase phosphorylation site. 

106-113 and 107-114 

Homologous region to Short-chain alcohol dehydrogenase 

amino acids 80-90, 131-168, 1-13. and 176-185 
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GCGACGTGGGCACCGCCATCAGCTGTTCGCGCGTCTTCTCCTCCAGGTGGGGCAGGGGTTTC 
GGGCTGGTGGAGCATGTGCTGGGACAGGACAGCATCCTCAATCAATCCAACAGCATATTCGG 
TTGCATCTTCTACACACTACAGCTATTGTTAGGTTGCCTGCGGACACGCTGGGCCTCTGTCC 
TGATGCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTACCTGGCCTGGATCCTGTTC 
TTCGTGCTCTATGATTTCTGCATTGTTTGTATCACCACCTATGCTATCAACGTGAGCCTGAT 
GTGGCTCAGTTTCCGGAAGGTCCAAGAACCCCAGGGCAAGGCTAAGAGGCACTGAGCCCTCA 
ACCCAAGCCAGGCTGACCTCATCTGCTTTGCTTTGGTCTTCAAGCCGCTCAGCGTGCCTGTG 
GACAGCGTGGCCCCGGCCCCCCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCGAGTGGCCC 
TGGCAGGCCAGTGTGAGGAGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACAC 
CTGGGTCCTCACTGCTGCCCACTGCTTTGAAAAGGCAGCAGCAACAGAACTGAATTCCTGGT 
CAGTGGTCCTGGGTTCTCTGCAGCGTGAGGGACTCAGCCCTGGGGCCGAAGAGGTGGGGGTG 
GCTGCCCTGCAGTTGCCCAGGGCCTATAACCACTACAGCCAGGGCTCAGACCTGGCCCTGCT 
GCAGCTCGCCCACCCCACGACCCACACACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCC 
CCTTTGGAGCCTCCTGCTGGGCCACTGGCTGGGATCAGGACACCAGTGATGCTCCTGGGACC 
CTACGCAATCTGCGCCTGCGTCTCATCAGTCGCCCCACATGTAACTGTATCTACAACCAGCT 
GCACCAGCGACACCTGTCCAACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTG 
GGGTGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGA 
CACTGGGTTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGT 
GCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTT 
TCCTGGCCCAGAGCCCAGAGACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGA 
TCCTTGAGGACAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCT 
GATGCACCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTG 
CTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGA 
CCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGG 
CTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGC CAGC CTGCGGC CCC 
TCTGCCTGCCCTATCCTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGG 
GCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAG 
GGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGG 
TGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGGCCTGTCTGGGGCACCACTGGTG 
CATGAGGTGAGGGGCACATGGTTCCTGGCCGGGCTGCACAGCTTCGGAGATGCTTGCCAAGG 
CCCCGCCAGGCCGGCGGTCTTCACCGCGCTCCCTGCCTATGAGGACTGGGTCAGCAGTTTGG 
ACTGGCAGGTCTACTTCGCCGAGGAACCAGAGCCCGAGGCTGAGCCTGGAAGCTGCCTGGCC 
AACATAAGCCAACCAACCAGCTGCTGACAGGGGACCTGGCCATTCTCAGGACAAGAGAATGC 
AGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAGGCAC 
CAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCA 
CTCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAGC 
AGGCGTCTCAGCTTTCCTCCTCCTTTACTCTTTCAGATACAATCACGCCAGCCACGTTGTTT 



AAAATAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA40571 

MLLSSLVSLAGSVYLAWILFFVLYDPCIVCITTYAINVSLMWLSFRKVQEPQGKAKRHGNTV 
PGEWPWQASVRRQGAHICSGSLVAD":[WLTAAHCFEKAAATELNSWSVVLGSLQREGLSPGA 
EEVGVAALQLPRAYNHYSQGSDLALLQLAHPTTHTPLCLPQPAHRFPFGASCWATGWDQDTS 
DAPGTLRNLRLRLISRPTCNCIYNQLHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVLC 
LEPDGHWVQAGI I S FAS S CAQEDAPVLLTNTAAHS S WLQARVQGAAFLAQS PETPEMSDEDS 
CVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGRQAPEEWSV 
GLGTRPEEWGLKQLILHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPYPDHHLPDGERG 
WVLGRARPGAGISSLQTVPVTLLGPRACSRLHAAPGGDGSPILPGMVCTSAVGELPSCEGLS 
GAPLVHEVRGTWFLAGLHSFGDACQGPARPAVFTALPAYEDWVSSLDWQVYFAEEPEPEAEP 
GSCLANISQPTSC 

Important features: 
Signal peptide: 

amino acids 1-15 

Homologous region to Serine proteases, trypsin family 

amino acids 79-95, 343-359 and 237-247 

N-glycosylation sites. 

amino acids 37-40 and 564-567 

Kringle domains 

amino acids 79-96, 343-360 and 235-247 
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CGGGCCGCCCCCGGCCCCCATTCGGGCCGGGCCTCGCTGCGGCGGCGACTGAGCCAGGCTGG 
GCCGCGTCCCTGAGTCCCAGAGTCGGCGCGGCGCGGCAGGGGCAGCCTTCCACCACGGGGAG 
CCCAGCTGTCAGCCGCCTCACAGGAAGATGCTGCGTCGGCGGGGCAGCCCTGGCATGGGTGT 
GCATGTGGGTGCAGCCCTGGGAGCACTGTGGTTCTGCCTCACAGGAGCCCTGGAGGTCCAGG 
TCCCTGAAGACCCAGTGGTGGCACTGGTGGGCACCGATGCCACCCTGTGCTGCTCCTTCTCC 
CCTGAGCCTGGCTTCAGCCTGGCACAGCTCAACCTCATCTGGCAGCTGACAGATACCAAACA 
GCTGGTGCACAGCTTTGCTGAGGGCCAGGACCAGGGCAGCGCCTATGCCAACCGCACGGCCC 
TCTTCCCGGACCTGCTGGCACAGGGCAACGCATCCCTGAGGCTGCAGCGCGTGCGTGTGGCG 
GACGAGGGCAGCTTCACCTGCTTCGTGAGCATCCGGGATTTCGGCAGCGCTGCCGTCAGCCT 
GCAGGTGGCCGCTCCCTACTCGAAGCCCAGCATGACCCTGGAGCCCAACAAGGACCTGCGGC 
CAGGGGACACGGTGACCATCACGTGCTCCAGCTACCAGGGCTACCCTGAGGCTGAGGTGTTC 
TGGCAGGATGGGCAGGGTGTGCCCCTGACTGGCAACGTGACCACGTCGCAGATGGCCAACGA 
GCAGGGCTTGTTTGATGTGCACAGCGTCCTGCGGGTGGTGCTGGGTGCGAATGGCACCTACA 
GCTGCCTGGTGCGCAACCCCGTGCTGCAGCAGGATGCGCACRGCTCTGTCACCATCACAGGG 
CAGCCTATGACATTCCCCCCAGAGGCCCTGTGGGTGACCGTGGGGCTGTCTGTCTGTCTCAT 
TGCACTGCTGGTGGCCCTGGCTTTCGTGTGCTGGAGAAAGATCAAACAGAGCTGTGAGGAGG 
AGAATGCAGGAGCTGAGGACCAGGATGGGGAGGGAGAAGGCTCCAAGACAGCCCTGCAGCCT 
CTGAAACACTCTGACAGCAAAGAAGATGATGGACAAGAAATAGCCTGACCATGAGGACCAGG 
GAGCTGCTACCCCTCCCTACAGCTCCTACCCTCTGGCTGCAATGGGGCTGCACTGTGAGCCC 
TGCCCCCAACAGATGCATCCTGCTCTGACAGGTGGGCTCCTTCTCCAAAGGATGCGATACAC 
AGACCACTGTGCAGCCTTATTTCTCCAATGGACATGATTCCCAAGTCATCCTGCTGCCTTTT 
TTCTTATAGACACAATGAACAGACCACCCACAACCTTAGTTCTCTAAGTCATCCTGCCTGCT 
GCCTTATTTCACAGTACATACATTTCTTAGGGACACAGTACACTGACCACATCACCACCCTC 

CTGACTGACCCCCTGCCTTATTTCACCAAAGACACGATGCATAGTCACCCCGGCCTTGTTTC 
TCCAATGGCCGTGATACACTAGTGATCATGTTCAGCCCTGCTTCCACCTGCATAGAATCTTT 
TCTTCTCAGACAGGGACAGTGCGGCCTCAACATCTCCTGGAGTCTAGAAGCTGTTTCCTTTC 
CCCTCCTTCCTCCCTGCCCCAAGTGAAGACAGGGCAGGGCCAGGAATGCTTTGGGGACACCG 
AGGGGACTGCCCCCCACCCCCACCATGGTGCTATTCTGGGGCTGGGGCAGTCTTTTCCTGGC 
TTGCCTCTGGCCAGCTCCTGGCCTCTGGTAGAGTGAGACTTCAGACGTTCTGATGCCTTCCG 
GATGTCATCTCTCCCTGCCCCAGGAATGGAAGATGTGAGGACTTCTAATTTAAATGTGGGAC 
TCGGAGGGATTTTGTAAACTGGGGGTATATTTTGGGGAAAATAAATGTCTTTGTAAAAAAAA 
AAAAAAAAAAAAAA 



FIGURE 51 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA41386 
xsubunit 1 of 1, 316 aa, 1 stop, 1 unknown 
><MW: -1, pi: 4.62, NX(S/T): 4 

MLRRRGSPGMGVHVGAALGALWFCLTGALEVQVPEDPWALVGTDATLCCSFSPEPGFSIAQ 

LNLIWQLTDTKQLVHSFAEGQDQGSAYANRTALFPDLLAQGNASLRLQRVRVADEGSFTCFV 

SIRDFGSAAVSLQVAAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVFWQDGQGVPL 

TGNVTTSQMANEQGLFDVHSVLRWLGANGTYSCLVRNPVLQQDAHXSVTITGQPMTFPPEA 

LWVTVGLSVCLIALLVALAFVCWRKIKQSCEEENAGAEDQDGEGEGSKTALQPLKHSDSKED 
DGQEIA 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 251-270 

N-glycosylation site. 

amino acids 91-94, 104-107, 189-192 and 215-218 

Homologous region to Immunoglobulins and MHC 

amino acids 217-234 



FIGURE 52 



TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAGTC 
CTGAACTTGTCTGAAGCCCTTGTCCGTAAGCCTTGAACTACGTTCTTAAATCTATGAAGTCG 
AGGGACCTTTCGCTGCTTTTGTAGGGACTTCTTTCCTTGCTTCAGCAAC ATGA GGCTTTTCT 
TGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGATTGGGGCTTTGATCCCTGAACCA 
GAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGCAAGACCAAAGGAGGGGA 
TTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGCTCCTTATTTCACTCCACTC 
ACAAACATAACAATGGTCAGCCCATTTGGTTTACCCTGGGCATCCTGGAGGCTCTCAAAGGT 
TGGGACCAGGGCTTGAAAGGAATGTGTGTAGGAGAGAAGAGAAAGCTCATCATTCCTCCTGC 
TCTGGGCTATGGAAAAGAAGGAAAAGGTAAAATTCCCCCAGAAAGTACACTGATATTTAATA 
TTGATCTCCTGGAGATTCGAAATGGACCAAGATCCCATGAATCATTCCAAGAAATGGATCTT 
AATGATGACTGGAAACTCTCTAAAGATGAGGTTAAAGCATATTTAAAGAAGGAGTTTGAAAA 
ACATGGTGCGGTGGTGAATGAAAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAG 
AAGATGAAGACAAAGATGGGTTTATATCTGCCAGAGAATTTACATATAAACACGATGAGTTA 
TAGAGATACATCTACCCTTTTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAA 
AGAACATTTTATTTTTATACAATGTTCTTTCTTGCTTTGTTTTTTATTTTTATATATTTTTT 
CTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTTCTTTCTGATAAGTTATT 
GGGAAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCACAG 
ATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACC 
ACGACATGAGACCAGGTTATAGCACAAATTAGCACCCTATATTTCTGCTTCCCTCTATTTTC 
TCCAAGTTAGAGGTCAACATTTGAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCAT 
GTTATAATGAAATAGTTTATGTGTAACTGGCTCTGAGTCTCTGCTTGAGGACCAGAGGAAAA 
TGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGATGTGCAATGCTGAAG 
TTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG 
GCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGGAGAAA 
CCCTATCTCTACTAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCT 
ACCCAGGAAGGCTGAGGCGGCAGAATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAG 
ATCACCTNCAGCCTGGACACTCTGTCTCGAAAAAAGAAAAGAACACGGTTAATACCATATNA 
ATATGTATGCATTGAGACATGCTACCTAGGACTTAAGCTGATGAAGCTTGGCTCCTAGTGAT 
TGGTGGCCTATTATGATAAATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATG 
TATCAATATGTTATAGATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGC 
TAGCGGAATATCCTTCCTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTACAATAACA 
TTGTATCATAAGATAAAGTAGTAAACCAGTCTACATTTTCCCATTTCTGTCTCATCAAAAAC 
TGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGGGCCAAGGAGGG 
TGGATCACTTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTA 
CTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAG 
GCTGAGACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCC 
ACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAGAAGCAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 



FIGURE 53 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44194 
xsubunit 1 of 1, 211 aa, 1 stop 
><MW: 24172, pi: 5.99, NX(S/T) : 1 

MRLFLWNAVLTLFVTS L I GALIPE PEVKI EVLQKPF I CHRKTKGGDLMLVHYEGYLEKDGSL 
FHSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLIIPPALGYGKEGKGKIPPEST 
LIFNIDLLEII^GPRSHESFQEiyroniSrDDWKLSKDEVKAYLKKEFEKHGAVVNESHHDALVED 
I FDKEDEDKDGFI S ARE FTYKHDEL 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorylation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulum targeting sequence, 
amino acids 208-211 

FKBP-type peptidyl -prolyl cis-trans isomerase 

amino acids 78-114 and 118-131 

EF-hand calcium-binding domain. 

amino acids 191-203, 184-203 and 140-159 

S-100/lCaBP type calcium binding domain 

amino acids 183-203 



FIGURE 54 



AATAAAGCTTCCTTAATGTTGTATATGTCTTTGAAGTACATCCGTGCATTTTTTTTTAGCAT 
CCAACCATTCCTCCCTTGTAGTTCTCGCCCCCTCAAATCACCCTCTCCCGTAGCCCACCCGA 
CTAACATCTCAGTCTCTGAAAATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCT 
CACGGGGCTCAGTCTCTTTTTCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCACAGTAC 
CTGCCACCCTCAACGTCCTCAATGGCTCTGACGCCCGCCTGCCCTGCACCTTCAACTCCTGC 
TACACAGTGAACCACAAACAGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTC 
TGAGGAGATGTTCCTCCAGTTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAG 
ACCGCGTGGAGTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTG 
CAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCCTGACCGCCACCGTGG 
CCATGGCAAGATCCATCTGCAGGTCCTCATGGAAGAGCCCCCTGAGCGGGACTCCACGGTGG 
CCGTGATTGTGGGTGCCTCCGTCGGGGGCTTCCTGGCTGTGGTCATCTTGGTGCTGATGGTG 
GTCAAGTGTGTGAGGAGAAAAAAAGAGCAGAAGCTGAGCACAGATGACCTGAAGACCGAGGA 
GGAGGGCAAGACGGACGGTGAAGGCAACCCGGATGATGGCGCCAAGTAGTGGGTGGCCGGCC 
CTGCAGCCTCCCGTGTCCCGTCTCCTCCCCTCTCCGCCCTGTACAGTGACCCTGCCTGCTCG 
CTCTTGGTGTGCTTCCCGTGACCTAGGACCCCAGGGCCCACCTGGGGCCTCCTGAACCCCCG 
ACTTCGTATCTCCCACCCTGCACCAAGAGTGACCCACTCTCTTCCATCCGAGAAACCTGCCA 
TGCTCTGGGACGTGTGGGCCCTGGGGAGAGGAGAGAAAGGGCTCCCACCTGCCAGTCCCTGG 
GGGGAGGCAGGAGGCACATGTGAGGGTCCCCAGAGAGAAGGGAGTGGGTGGGCAGGGGTAGA 
GGAGGGGCCGCTGTCACCTGCCCAGTGCTTGCCTGGCAGTGGCTTCAGAGAGGACCTGGTGG 
GGAGGGAGGGCTTTCCTGTGCTGACAGCGCTCCCTCAGGAGGGCCTTGGCCTGGCACGGCTG 
TGCTCCTCCCCTGCTCCCAGCCCAGAGCAGCCATCAGGCTGGAGGTGACGATGAGTTCCTGA 
AACTTGGAGGGGCATGTTAAAGGGATGACTGTGCATTCCAGGGCACTGACGGAAAGCCAGGG 
CTGCAGGCAAAGCTGGACATGTGCCCTGGCCCAGGAGGCCATGTTGGGCCCTCGTTTCCATT 
GCTAGTGGCCTCCTTGGGGCTCCTGTTGGCTCCTAATCCCTTAGGACTGTGGATGAGGCCAG 
ACTGGAAGAGCAGCTCCAGGTAGGGGGCCATGTTTCCCAGCGGGGACCCACCAACAGAGGCC 
AGTTTCAAAGTCAGCTGAGGGGCTGAGGGGTGGGGCTCCATGGTGAATGCAGGTTGCTGCAG 
GCTCTGCCTTCTCCATGGGGTAACCACCCTCGCCTGGGCAGGGGCAGCCAAGGCTGGGAAAT 
GAGGAGGCCATGCACAGGGTGGGGCAGCTTTCTTTGGGGCTTCAGTGAGAACTCTCCCAGTT 
GCCCTTGGTGGGGTTTCCACCTGGCTTTTGGCTACAGAGAGGGAAGGGAAAGCCTGAGGCCG 
GCATAAGGGGAGGCCTTGGAACCTGAGCTGCCAATGCCAGCCCTGTCCCATCTGCGGCCACG 
CTACTCGCTCCTCTCCCAACAACTCCCTTCGTGGGGACAAAAGTGACAATTGTAGGCCAGGC 
ACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATTACCTCCAT 
CTGTTTAGTAGAAATGGGCAAAACCCCATCTCTACTAAAAATACAAGAATTAGCTGGGCGTG 
GTGGCGTGTGCCTGTAATCCCAGCTATTTGGGAGGCTGAGGCAGGAGAATCGCTTGAGCCCG 
GGAAGCAGAGGTTGCAGTGAACTGAGATAGTGATAGTGCCACTGCAATTCAGCCTGGGTGAC 
ATAGAGAGACTCCATCTCAAAAAAAA. 



FIGURE 55 

< /usr / s eqdb2 / s s t /DNA/Dna seqs . min/ s s . DNA4 5 4 1 5 
<subunit 1 of 1, 215 aa, 1 stop 
<MW: 24326, pi: 6.32, NX(S/T): 4 

MHRDAWLPRPAFSLTGLSLFFSLVPPGRSMEVTVPATLNVLNGSDARLPCTFNSCYTVNHKQ 
FSLNWTYQECNN CSEEMFLQFRMKI INLKLERFQDRVEFSGNPSKYDVSVMLRNVQPEDEGI 
YNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSTVAVIVGASVGGFLAWILVLMWKCVRRK 
KEQKLSTDDLKTEEEGKTDGEGNPDDGAK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 161-179 

Immunoglobulin- like fold: 

amino acids 83-127 

N-glycosylation sites. 

amino acids 42-45, 66-69 and 74-77 



IT 



FIGURE 56 




GTTCTCGCCCCCTCAAATCACCTTCTCCCTTAGCCCACCCNACTAACATCTCAGTCTCTGAA 
AATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCTCACGGGGCTCAGTCTCTTTT 
TCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCCACAGTACCTGNCCACCCTCAACGTCC 
TCAATGGCTCTGACGCCCGCCTGCCCTGCCCTTCAACTCCTGCTACACAGTGAACCACAAAC 
AGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTCTGAGGAGATGTTCCTCCAG 
TTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAGACCGCGTGGAGTTCTCAGG 
GAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTGCAGCCGGAGGATGAGGGGA 
TTTACAACTGCTACATCATGAACCCCCC 



FIGURE 57 



TCACGGGGCTCATCTCTTTTTCTCTTTGGTGCCCACCAGGACGGAGCATGGAGGTNCACATA 
CCTGCCACCCTCAACGTCCTCAATGGCTTTGACGCCCGCCTGCCCTGCACCTTCAACTCCNG 
CTACACAGTGAACCACAAACAGTTCTCCCTGAACTGGATTTACCAGGAGTGCAACAACTGGC 
TCTGAGGAGATGTTCCTCCAGTTCCCGCATGGAAGATCATTTAACCTGAAAGCTGGAAGCGG 
TTTT C AAGAAC CGCGTGGAAGTTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGC 
TGAGAAACGTGCAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCC 



FIGURE 58 



TGCGGCGACCGTCGTACACCatTGGGCCTCCACCTCCGCCCCTACCGTGTGGGGCTGCTCCCGGATGGCCTCCTGT 
TCCTCTTGCTGCTGCTAATGCTGCTCGCGGACCCAGCGCTCCCGGCCGGACGTCACCCCCCAGTGGTGCTGGTCC 
CTGGTGATTTGGGTAACCAACTGGAAGCCMvGCTGGAC^GCCGACAGTGGTGCACTACCTCTGCTCCAAGAAGA 
CCGAAAGCTACTTCACAATCTGGCTGAACCTGGAACTGCTGCTGCCTGTCATCATTGACTGCTGGATTGACAATA 
TCAGGCTGGTTTACAACAAAACATCCAGGGCCACCCAGTTTCCTGATGGTGTGGATGTACGTGTCCCTGGCTTTG 
GGAAGACCTTCTCACTGGAGTTCCTGGACCCCAGCAAAAGCAGCGTGGGTTCCTATTTCCACACCATGGTGGAGA 
GCCTTGTGGGCTGGGGCTACACACGGGGTGAGGATGTCCXSAGGGGOTCCCTATGACTGGCGCCGAGCCCC^AATG 
AAAACGGGCCCTACTTCCTGGCCCTCCGCGAGATGATCGAGGAGATGTACCAGCTGTATGGGGGCCCCGTGGTGC 
TGGTTGCCCACAGTATGGGCAACATGTACACGCTCTACTTTCTGCAGCGGCAGCCGCAGGCCTGGAAGGACAAGT 
ATATCCGGGCCTTCGTGTCACTGGGTGCGCCCTGGGGGGGCGTGGCCAAGACCCTGCGCGTCCTGGCTTCAGGAG 
ACAACaACCGGATCCCAGTCATCGGGCCCCTGAAGATCCGGGAGCAGCAGCGGTCAGCTGTCTCCACCAGCTGGC 
TGCTGCCCTACAACTAC^CATGGTCACCTGAGAAGGTGTTCGTGCAGACACCCACAATCAACTACACACTGCGGG 
ACTACCGCAAGTTCTTCCAGGACATCGGCTTTGAAGATGGCTGGCTCATGCGGCAGGACACAGAAGGGCTGGTGG 
AAGCCACGATGCCACCTGGCGTGCAGCTGCACTGCCTCTATC 

ATGAGAGCTTCCCTGACCGTGACCCTAAAATCTGCTTTGGTGACGGCGATGGTACTGTGAACTTGAAGAGTGCCC 
TGC^GTGCC^GGCCTGGCAGAGCCGCCAGGAGCACCAAGTGTTGCTGCAGGAGCTGCCAGGCAGCGAGCACATCG 
AGATGCTGGCCAACGCCACCACCCTGGCCTATCTGAAACGTGTGCTCCTTGGGCC CTGA CTCCTGTGCCACAGGA 
CTCCTGTGGCTCGGCCGTGGACCTGCTGTTGGCCTCTGGGGCTGTCATGGCCCACGCGTTTTGCAAAGTTTGTGA 
CTCACCATTCAAGGCCCCGAGTCTTGGACTGTGAAGCATCTGCCATGGGGAAGTGCTGTTTGTTATCCTTTCTCT 
GTGGCAGTGAAGAAGGAAGAAATGAGAGTCTAGACTCAAGGGACACTGGATGGCAAGAATGCTGCTGATGGTGGA 
ACTGCTGTGACCTTAGGACTGGCTCCAC^GGGTGGACTGGCTGGGCCCTGGTCCCAGTCCCTGCCTGGGGCCATG 
TGTCCCCCTATTCCTGTGGGCTTTTCATACTTGCCTACTGGGCCCTGGCCCCGCAGCCTTCCTATGAGGGATGTT 
ACTGGGCTGTGGTCCTGTACCCAGAGGTCCCAGGGATCGGCTCCTGGCCCCTCGGGTGACCCTTCCCACACACCA 
GCCACAGATAGGCCTGCCACTGGTCATGGGTAGCTAGAGCTGCTGGCTTCCCTGTGGCTTAGCTGGTGGCCAGCC 
TGACTGGCTTCCTGGGCGAGCCTAGTAGCTCCTGCAGGCAGGGGCAGTTTGTTGCGTTCTTCGTGGTTCCCAGGC 
CCTGGGACATCTCACTCCACTCCTACCTCCCTTACCACC^ 

CCCCCAGTCCCGCAGGCTGTGTTCCAGGGGCCCTGATTTCCTCGGATGTGCTATTGGCCCCaGGACTGAAGCTGC 
CTCCCTTC^CCCTGGGACTGTGGTTCCIAAGGATGAGAGCAGGGGTTGGAGCCATGGCCTTCTGGGAACCTATGGA 
GAAAGGGAATCCAAGGAAGCAGCCAAGGCTGCTCGCAGCTTCCCTGAGCTGCACCTCTTGCTAACCCCACCATC^ 
CACTGCC^CCCTGCCCTAGGGTCTCACTAGTACCAAGTGGGT(^GCACAGGGCTGAGGATGGGGCTCCTATCCAC 
CCTGGCCAGCACCCAGCTTAGTGCTGGGACTAGCCCAGAAACTTGAATGGGACCCTGAGAGAGCCAGGGGTCCCC 
TGAGGCCCCCCTAGGGGCTTTCTGTCTGCCCCAGGGTGCTCCATGGATCTCCCTGTGGaVGCAGGCATGGAGAGT 
CAGGGCTGCCTTCATGGCAGTAGGCTCTAAGTGGGTGACTGGCCACAGGCCGAGAAAAGGGTACAGCCTCTAGGT 
GGGGTTCCCAAAGACGCCTTCAGGCTGGACTGAGCTGCTCT 

TTGCATAC^TGCCTGGCATCTGTCTCCCCTTGTTCCTGAGTGGCCCCACATGGGGCTCTGAGCAGGCTGTATCTG 
GATTCTGGCAATAAAAGTACTCTGGATGCTGTAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 59 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44189 
xsubunit 1 of 1, 412 aa, 1 stop 
><MW: 46658, pi: 6.65, NX{S/T) : 4 

MGLHLRPYRVGLLPDGLLFLLLLLMLLADPALPAGRHPPVVLVPGDLGNQLEAKLDKPTVVH 
YLCSKKTESYFTIWLNLELLLPVIIDCWIDNIRLVYNKTSRATQFPDGVDVRVPGFGKTFSL 
EFLDPSKSSVGSYFHTMVESLVGWGYTRGEDVRGAPYDWRRAPNENGPYFLALREMIEEMYQ 
LYGGPWLVAHSMGNMYTLYFLQRQPQAWKDKYIRAFVSLGAPWGGVAKTLRVLASGDNNRI 
PVIGPLKIREQQRSAVSTSWLLPYNYTWSPEKVFVQTPTINYTLRDYRKFFQDIGFEDGWLM 
RQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFGDGDGTVNLKSALQCQ 
AWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRVLLGP 

Important features : 
Signal peptide: 

amino acids 1-28 

Potential lipid substrate binding site: 
amino acids 147-164 

N-glycosylation sites. 

amino acids 99-102, 273-276, 289-292 and 398-401 

Lipases, serine proteins 

amino acids 189-201 

Beta-transducin family Trp-Asp repeat 

amino acids 353-365 



FIGURE 60 



CGGACGCGTGGGCGGACGCGTGGGGCGGCGGCAGCGGCGGCGACGGCGACATGGAGAGCGGG 
GCCTACGGCGCGGCCAAGGCGGGCGGCTCCTTCGACCTGCGGCGCTTCCTGACGCAGCCGCA 
GGTGGTGGCGCGCGCCGTGTGCTTGGTCTTCGCCTTGATCGTGTTCTCCTGCATCTATGGTG 
AGGGCTACAGCAATGCCCACGAGTCTAAGCAGATGTACTGCGTGTTCAACCGCAACGAGGAT 
GCCTGCCGCTATGGCAGTGCCATCGGGGTGCTGGCCTTCCTGGCCTCGGCCTTCTTCTTGGT 
GGTCGACGCGTATTTCCCCCAGATCAGCAACGCCACTGACCGCAAGTACCTGGTCATTGGTG 
ACCTGCTCTTCTCAGCTCTCTGGACCTTCCTGTGGTTTGTTGGTTTCTGCTTCCTCACCAAC 
CAGTGGGCAGTCACCAACCCGAAGGACGTGCTGGTGGGGGCCGACTCTGTGAGGGCAGCCAT 
CACCTTCAGCTTCTTTTCCATCTTCTCCTGGGGTGTGCTGGCCTCCCTGGCCTACCAGCGCT 
ACAAGGCTGGCGTGGACGACTTCATCCAGAATTACGTTGACCCCACTCCGGACCCCAACACT 
GCCTACGCCTCCTACCCAGGTGCATCTGTGGACAACTACCAACAGCCACCCTTCACCCAGAA 
CGCGGAGACCACCGAGGGCTACCAGCCGCCCCCTGTGTACTGAGTGGCGGTTAGCGTGGGAA 
GGGGGACAGAGAGGGCCCTCCCCTCTGCCCTGGACTTTCCCATCAGCCTCCTGGAACTGCCA 
GCCCCTCTCTTTCACCTGTTCCATCCTGTGCAGCTGACACACAGCTAAGGAGCCTCATAGCC 
TGGCGGGGGCTGGCAGAGCCACACCCCAAGTGCCTGTGCCCAGAGGGCTTCAGTCAGCCGCT 
CACTCCTCCAGGGCACTTTTAGGAAAGGGTTTTTAGCTAGTGTTTTTCCTCGCTTTTAATGA 
CCTCAGCCCCGCCTGCAGTGGCTAGAAGCCAGCAGGTGCCCATGTGCTACTGACAAGTGCCT 
CAGCTTCCCCCCGGCCCGGGTCAGGCCGTGGGAGCCGCTATTATCTGCGTTCTCTGCCAAAG 
ACTCGTGGGGGCCATCACACCTGCCCTGTGCAGCGGAGCCGGACCAGGCTCTTGTGTCCTCA 
CTCAGGTTTGCTTCCCCTGTGCCCACTGCTGTATGATCTGGGGGCCACCACCCTGTGCCGGT 
GGCCTCTGGGCTGCCTCCCGTGGTGTGAGGGCGGGGCTGGTGCTCATGGCACTTCCTCCTTG 
CTCCCACCCCTGGCAGCAGGGAAGGGCTTTGCCTGACAACACCCAGCTTTATGTAAATATTC 
TGCAGTTGTTACTTAGGAAGCCTGGGGAGGGCAGGGGTGCCCCATGGCTCCCAGACTCTGTC 
TGTGCCGAGTGTATTATAAAATCGTGGGGGAGATGCCCGGCCTGGGATGCTGTTTGGAGACG 
GAATAAATGTTTTCTCATTCAAAG 



IP 



FIGURE 61 

</usr/ seqdb2/sst/DNA/Dnasegs .min/ss .DNA483 04 
<subunit 1 of 1, 224 aa, 1 stop 
<MW: 24810, pi: 4.75, NX(S/T): 1 

MES GAYGAAKAGGSFDLRRFLTQPQVVARAVCLVFAL I VFSC I YGEGYSNAHE SKQMYCVFN 
RNEDACRYGS AI GVLAFLASAFFL VVDAYFPQ I SNATDRKYLVI GDLLFS ALWTFLWFVGFC 
FLTNQWAVTNPKDVLVGADSVRAAITFSFFSIFSWGVIASIAYQRYKAGVDDFIQKTYVDPTP 
DPNTAYASYPGASVDNYQQPPFTQNAETTEGYQPPPVY 

Important features: 

Type II Transmembrane domain: 

amino acids 1-45 

Other transmembrane domains: 

amino acids 74-90, 108-126 and 145-161 

N-glycosylation site. 

amino acids 97-100 



FIGURE 62 

GAGCC^CCTACCCTGCTCCGAGGCCAGGCCTGCAGGGCCTCATCGGCCAGAGGGTGATCAGTGAGCAGAAGGATG 

CCCGTGGCCGAGGCCCCCCAGGTGGCTGGCGGGCAGGGGGACGGAGGTGATGGCGAGGAAGCGGAGCCAGAGGGG 

ATGTTCAAGGCCTGTGAGGACTCCAAGAGAAAAGCCCGGGGCTACCTCCGCCTGGTGCCCCTGTTTGTGCTGCTG 

GCCCTGCTCGTGCTGGCTTCGGCGGGGGTGCTACTCTGGTATTTCCTAGGGTACAAGGCGGAGGTGATGGTCAGC 

CaGGTGTACTCAGGCAGTCTGCGTGTACTCAATCGCCACTTCTCCCAGGATCTTACCCGCCGGGAATCTAGTGCC 

TTCCGCAGTGAAACCGCCAAAGCCCAGAAGATGCTCAAGGAGCTCATCACCAGCACCCGCCTGGGA 

AACTCCAGCTCCGTCTATTCCTTTGGGGAGGGACCCCTCACCTGCTTCTTCTGGTTCATTCTCCAAATCCCCGAG 

CACCGCCGGCTGATGCTGAGC C CCGAGGTGGT GCAGGCACTGC TGGTGGAGGAGCTG CTGTCCACAGT CAACAGC 

TCGGCTGCCGTCCCCTACAGGGCCGAGTACGAAGTGGACCCCGAGGGCCTAGTGATCCTGGAAGCCAGTGTGAAA 

GACATAGCTGCATTGAATTCCACGCTGGGTTGTTACCGCTACAGCTACGTGGGCCAGGGCCAGGTCCTCCGGCTG 

AAGGGGCCTGACCACCTGGCCTCCAGCTGCCTGTGGCACCTGCAGGGCCCCAAGGACCTCATGCTCAAACTCCGG 

CTGGAGTGGACGCTGGCAGAGTGCCGGGACCGACTGGCCATGTATGACGTGGGCGGGCCCCTGGAGAAGAGGCTC 

ATCACCTCGGTGTACGGCTGCAGCCGCCAGGAGCCCGTGGTGGAGGTTCTGGCGTCGGGGGCCATCIATGGCGGTC 

GTCTGGAAGAAGGGCCTGCACAGCTACTACGACCCCTTCGTGCTCTCCGTGCAGCCGGTGGTCTTCCAGGCCTGT 

GAAGTGAACCTGACGCTGGACAACAGGCFCGACTCCCAGGGCGTCCTCAGC!ACCCCGTACTTCCCCAGCTACTAC 

TCGCCCCAAACCCACTGCTCCTGGCACCTCACGGTGCCCTCTCTGGACTACGGCTTGGCCCTCTGGTTTGATGCC 

TATGCACTGAGGAGGCAGAAGTATGATTTGCCGTGCACCCAGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGT 

GGCTTGCGCATCCTGC^GCCCTACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACC 

TCCCAGATCTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCCCTGGA 

GAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGACTGCCCCAACGGCCTGGAT 

GAGAGAAACTGCGTTTGC^GAGCCACATTCCAGTGCAAAGAGGAC^ 

GATGGGCAGCCTGATTGTCTCAACGGCAGCGATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACC 
TTCC^GTGTGAGGACCGGAGCTGCGTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGC 
TCGGATGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGTCCTCCGAG 
GGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGGGGGGCCCTCATCGCTGACCGC 
TGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCATGGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGC 
AAGGTGTGGCAGAACTCGCGCTGGCCTGGAGAGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCAC 
GAAGAGGACAGCCATGACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGC 
CCCGTCTGCCTGCCCGCGCGCTCCCACTTOTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCTGGGGCGCCTTG 
CGCGAGGGCGGCCCCATCAGCAACGCTCTGC^GAAAGTGGATGTGCAGTTGATCCCACAGGACCTGTGCAGCGAG 
GCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGCCGGCTACCGCAAGGGCAAGAAGGATGCCTGTCAGGGT 
GACTCAGGTGGTCCGCTGGTGTGCAAGGCACITCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTG 
GGCTGTGGCCGGCCTAACTACTTCGGCGTCTACACCCGC^TCACAGGTGTGATCAGCTGGATCCAGC^GTGGTG 
ACC^GGAACTGCCCCCCTGCAAAGCAGGC4CCCACCTCCTGGACTC^GAGAGCCCAGGGC^CTGCC^AGCAGG 
GGGACAAGTATTCTGGCGGGGGGTGGGGGAGAGAGCAGGCCCTGTGGTGGCAGGAGGTGGCATCTTGTCTCGTCC 
CTGATGTCTGCTCCAGTGATGGCAGGAGGArGGAGAAGTGCCAGCAGCTGGGGGTCAAGACGTCCCCTGAGGACC 
C^GGCCCAC^CCC^GCCCTTCTGCCTCCC^ 

GCAGTGGCTCaGCaGCAAGAATGCTGGTTCTACATCCCGAGGAGTGTCTGAGGTGCGCCCCACTCTGTACAGAGG 
CTGTTTGGGCAGCCTTGCCTCCAGAGAGCAGATTCCAGCTTCGGAAGCCCCTGGTCTAACTTGGGATCTGGGAAT 
GGAAGGTGCTCCCATCGGAGGGGACCCTCAGAGCCCTGGAGACTGCCAGGTGGGCCTGCTGCCACTGTAAGCCAA 
AAGGTGGGGAAGTCCTGACTCCAGGGTCCTTGCCCCACCCCTGCCTGCCACCTGGGCCCTCACAGCCCAGACCCT 
CACTGGGAGGTGAGCTCAGCTGCCCTTTGGAATAAAGCTGCCTGATGAAAAAAAAAAAAAAAAAAAAA 



FIGURE 63 



>< /usr / s eqdb2 / s s t /DNA/Dnaseqs .min/ss. DNA4 9152 
xsubunit 1 of 1, 802 aa, 1 stop 
><MW: 88846, pi: 6.41, NX(S/T): 7 

MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLFVLL^LVLASAGVL 
LWYFLGYKAEVMVSQVYSGSLRVLNRHFSQDLTRRESSAFRSETAKAQKMLKELITSTRLGT 
YYNSSSVYS FGEGPLTCFFWF I LQ I PEHRRLMLSPEWQALLVEELLSTWSSAAVPYRAEY 
EVDPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLML 
KLRLEWTLAECRDRLAMYDVAGPLEEIRLITSVYGCSRQEPVVEVI^SGAIMAVVWKKGLHSY 
YD P F VL S VQ PVVFQACEVNLTLDNRLDSQGVLSTPYFPS YYS PQTHC S WHLTVPS LDYGLAL 
WFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWATAGITINFTSQISLTGP 
GVRVHYGLYNQSD PC PGE FLC SVNGLC VPACDGVKD CPNGLDERNCVCRATFQCKEDSTC I S 
LPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCD 
CGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVL 
WTVFLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPVVRSAAVRPVCLPA 
RSHFFEPGLHCWITGWGALREGGPISNALQKVDVQLIPQDLCSEAYRYQVTPRMLCAGYRKG 
KKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQWT 

Important features : 

Type II transmembrane domain: 

amino acids 46-67 

Serine proteases, trypsin family, histidine active site, 
amino acids 604-609 
N-glycosylation sites. 

amino acids 127-130, 175-178, 207-210, 329-332, 424-427, 444-447 

and 509-512 

Kr ingle domains. 

amino acids 746-758 and 592-609 

Homologous region to Kallikrein Light Chain: 

amino acids 568-779 

Homologous region to Low-density lipoprotein receptor: 

amino acids 451-567 



GCACCCAGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCC 
TACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGAT 
CTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCC 
CTGGAGAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGAC 
TGCCCCAACGGCCTGGATGAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGA 
CAGCACATGCATCTCACTGCCCAAGGTCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCG 
ATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGG 
AGCTGCGTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGA 
TGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGT 
CCTCCGAGGGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGG 
GGGGCCCTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCAT 
GGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGCAAGGTGTGGCAGAACTCGCGCTGGCCTG 
GAGAGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCACGAAGAGGACAGCCAT 
GACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCC 
CGTCTGCCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCT 
GGGGCGCCTTGCGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAAAGTGGATGTGCAGTTG 
ATCCCACAGGACCTGTGCAGCGAGGCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGC 
CGGCTACCGCAAGGGCAAGAAGGATGCCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCA 
AGGCACTCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGG 
CCTAACTACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGT 
GACCTGAGGAACTGCCCCCCTGCAAAGCAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGC 
AACTGCCAAGCAGGGGGACAAGTAT 



FIGURE 65 



GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGG 
CTCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCG 
AGAGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAA 
ACTTCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAGGATGAGCAACTCCGTTCCTCTGCTCTG 
TTTCTGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGAC 
GGCTGGAAGATAAGCTCCACAAACCCAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGG 
TTTAACCTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAG 
CCAGCCCTTAGAAGACTGCAGTTTCAACATGACAGCTAAAACCTTTTTCATCATTCACGGAT 
GGACGATGAGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGA 
GAGAAAGACGCCAATGTAGTTGTGGTTGACTGGCTCCCCCTGGCCCACCAGCTTTACACGGA 
TGCGGTCAATAATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGG 
AGAAGGACGATTTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTG 
GCCGGGTATGCAGGCAACTTCGTGAAAGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGC 
CGGGCCCATGTTTGAAGGGGCCGACATCCACAAGAGGCTCTCTCCGGACGATGCAGATTTTG 
TGGATGTCCTCCACACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTG 
GGCCACATTGACATCTACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGT 
CTTGGGATCAATTGCATATGGAACAATCACAGAGGTGGTAAAATGTGAGCATGAGCGAGCCG 
TCCfiCCTCTTTGTTGACTCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACT 
GACTCCAATCGCTTCAAAAAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCAT 
TGGCTACAATGCCAAGAAAATGAGGAACAAGAGGAACAGCAAAATGTACCTAAAAACCCGGG 
CAGGCATGCCTTTCAGAGGTAACCTTCAGTCCCTGGAGTGTCCCTGAGGAAGGCCCTTAATA 
CCTCCTTCTTAATACCATGCTGCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCA 
CAATCCAATCAAATCGTTGCAAATCAGATTACACTGTGCATGTCCTAGGAAAGGGAATCTTT 
ACAAAATAAAC^GTGTGGACCCCTAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 



FIGURE 66 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49646 
xsubunit 1 of 1, 354 aa, 1 stop 
><MW: 39362, pi: 8.35, NX(S/T): 2 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKLHKPKATQTEVKPSVRFNLRTSKDPEHE 
GCYLSVGHSQPLEDCSFNMTAKTFFIIHGWTMSGIFENWLHKLVSALHTREKDANVVVVDWL 
PLAHQLYTDAVNNTRWGHSIARMLDWLQEKDDFSLGlT^LIGYSLGAEtVAGYAGNFVKGW 
GRITGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDF 
QPGCGLNDVLGSIAYGTITEWKCEHERAVHLFVDSLVNQDKPSFAFQCTDSNRFKKGICLS 
CRKNRCNS IGYNAKKMRNKRNSKMYLKTRAGMPFRGNLQSLECP 

Important features : 
Signal peptide: 

amino acids 1-16 

Lipases, serine active site. 

amino acids 163-172 

N-glycosylation sites. 

amino acids 80-83 and 136-139 



FIGURE 67 

CGGACGCGTGGGCGGACGCGTGGGCCTGGGCAAGGGCC03GGCGCCGGGCCGAGCCACCTCTTCCCCTCCCCCGC 
TTCCCTGTCGCGCTCCGCTGGCTGGACGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGCTQAC!AGT 
CGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAGGTGGCGACCAGGCCAGACCAGGGGCGCTCG 
CTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAG 
AGJ^GAGTGCGGCGGCGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGCCG 
CCGCCTCCCCGCCCCOVGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGGGAGGCGG 
CACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCACAGAAGCTCCGGGACCCTTCCG 
GCACCTCTGGA<^GCCCAGGATGCTGTTGGCCACCCTCCTCCTCCTCCTCCTTGGAGGCGCTCTGGCCCATCCAG 
ACCGGATTATTTTTCCAAATC^TGCTTGTGAGGACCCCCCAGCAGTGCTCTTAGAAGTGCAGGGCACCTTACAGA 
GGCCCCTGGTCCGGGACAGCCGCACCTCCCCTGCCAACTGCACCTGGCTCATCCTGGGCAGCAAGGAACAGACTG 
TCACCATCAGGTTCCAGAAGCTACACCTGGCCTGTGGCTCAGAGCGCTTAACCCTACGCTCCCCTCTCCAGCCAC 
TGATCTCCCTGTGTGAGGCACCTCCCAGCC^^ 

CTGGGGCCAGAGC^CCCATGGGCCAGGGCTTCCTGCTCTCCTACAGCCAAGATTGGCTGATGTGCCTGCAGGAAG 
AGTTTCAGTGCCTGAACCACCGCTGTGTATCTGCTGTCCAGCGCTGTGATGGGGTTGATGCCTGTGGCGATGGCT 
CTGATGAAGCAGGTTGCAGCTC^GACCCCT07CCCTGGCCTGACCCC^GACCCGTCCCCTCCCTGCCTTGCAATG 
TCACCTTGGAGGACTTCTATGGGGTCTTCTCCTCTC^ 

CCTGCCATTGGCTGCTGGACCCCCATGATGGCCGGCGGCTGGCCGTGCGCTTCACAGCCCTGGACTTGGGCTTTG 
GAGATGCAGTGCATGTGTATGACGGCCCTGGGCCCCCTGAGAGCTCCCGACTACTGCGTAGTCTCACCCACTTCA 
GCAATGGCAAGGCTGTCACTGTGGAGACACTGTCTGGCCAGGCT^ 

ATGGTCGTGGCTTCAATGCCACCTACGMGTGCGGGGCTATTGCTTGCCTTGGGACAGACCCTGTGGCTTAGGCT 
CTGGCCTGGGAGCTGGCGAAGGCCTAGGTGAGCGCTGCTACAGTGAGGCACAGCGCTGTGACGGCTCATGGGACT 
GTGCTGACGGCAC^GATGAGGAGGACTGCCCAGGCTGCCCACCTGGACACTTCCCCTGTGGGGCTGCTGGCACCT 
CTGGTGCCACAGCCTGCTACCTGCCTGCTGACCGCTGCAACTACCAGACTTTCTGTGCTGATGGAGCAGATGAGA 
GACGCTGTCGGGATTGCCAGCCTGGCAATTTCCGATGCCGGGACGAGAAGTGCGTGTATGAGACGTGGGTGTGCG 
ATGGGCAGCC^GACTGTGCGGACGGCAGTGATGAGTGGGACTGCTCCTATGTTCTGCCCCGCAAGGTCATTACAG 
CTGCAGTCATTGGCAGCCTAGTGTGCGGCCTGCTCCTGGTCATCGCCCTGGGCTGCACCTGCAAGCTCTATGCCA 
TTCGCACCCAGGAGTACAGCATCTTTGCCCCCCTCTCCCGGATGGAGGCTGAGATTGTGCAGCAGCAGG(^CCCC 
CTTCCTACGGGCAGCTCATTGCCCAGGGTGCCATCCCACCTGTAGAAGACTTTCCTACAGAGAATCCTAATGATA 
ACTCAGTGCTGGGC^CCTGCGTTCTCTGCTACAGATCTTACGCCAGGATATGACTCCAGGAGGTGGCCCAGGTG 
CCCGCCGTCGTCAGCGGGGCCGCTTGATGCGACGCCTGGTACGCCGTCTCCGCCGCTGGGGCTTGCTCCCTCGAA 
CCAAC&CCCCGGCTCGGGCCTCTGAGGCCAGATCC^ 

GTGGCACAGGTCCAGCCCGTGAGGGCGGGGCAGTGGGTGGGCAAGATGGGGAGCAGGCACCCCCACTGCCCATCA 
AGGCTCCCCTCCCATCTGCTAGCACGTCTCCAGCCCCCACTACTGTCCCTGAAGCCCCAGGGCCACTGCCCTCAC 
TGCCCCTAGAGCCATCACTATTGTCTGGAGTGGTGCAGGCCCTGCGAGGCCGCCTGTTGCCCAGCCTGGGGCCCC 
CAGGACCAACCCGGAGCCCCCCTGGACCC^CACAGCAGTCCTGGCCCTGGAAGATGAGGACGATGTGCTACTGG 
TGCCACTGGCTGAGCCGGGGGTGTGGGTAGCTGAGGCAGAGGATGAGCCACTGCTTACCTGAGGGGACCTGGGGG 
CTCTACTGAGGCCTCTCCCCTGGGGGCTCTACTCATAGTGGCACAACCTTTTAGAGGTGGGTCAGCCTCCCCTCC 
ACCACTTCCTTCCCTGTCCCTGGATTTCAGGGACTTGGTGGGCCTCCCGTTGACCCTATGTAGCTGCTATAAAGT 
TAAGTGTCCCTCAGGCAGGGAGAGGGCTCACAGAGTCTCCTCTGTACGTGGCCATGGCCAGACACCCCAGTCCCT 
TCACCACCACCTGCTCCCCACGCCACCACCATTTGGGTGGCTGTTTTTAAAAAGTAAAGTTCTTAGAGGATCATA 
GGTCTGGACACTCC^TCCTTGCCAAACCTCTACCCAAAAGTGGCCTTAAGCACCGGAATGCCAATTAACTAGAGA 
CCCTCCAGCCCCCAAGGGGAGGATTTGGGCAGAACCTGAGGTTTTGCCATCCACAATCCCTCCTACAGGGCCTGG 
CTCACAAAAAGAGTGCAACAAATGCTTCTATTCCATAGCTACGGCATTGCTCAGTAAGTTGAGGTCAAAAATAAA 
GGAATCATACATCTC 



FIGURE 68 

</usr/seqdb2/sst/DNA/Dnase;qs.min/ss.DNA49631 
<subunit 1 of 1, 713 aa, 1 stop 
<MW: 76193, pi: 5.42, NX(S/T) : 4 

MLLATLLLLLLGGALAHPDRI I FPNHACEDPPAVLLEVQGTLQRPLVRDSRTSPANCTWLIL 
GSKEQTVTIRFQKLHLACGSERLTLRSPLQPLISLCEAPPSPLQLPGGNVTITYSYAGARAP 
MGQGFLLSYSQDWLMCLQEEFQCLNHRCVSAVQRCDGVDACGDGSDEAGCSSDPFPGLTPRP 
VPSLPCNVTLEDFYGVFSSPGYTHLASVSHPQSCHWLLDPHDGRRLAVRFTALDLGFGDAVH 
VYDGPGPPESSRLLRSLTHFSNGKAArrVETLSGQAWSYHTVAWSNGRGFNATYHVRGYCLP 
WDRPCGLGSGLGAGEGLGERCYSEAQRCDGSWDCADGTDEEDCPGCPPGHFPCGAAGTSGAT 
ACYLPADROSryQTFCADGADERRCRHCQPGNFRCRDEKCVYETWVCDGQPDCADGSDEWDCS 
YVLPRKVITAAVIGSLVCGLLLVIALGCTCKLYAIRTQEYSIFAPLSRMEAEIVQQQAPPSY 
GQLIAQGAIPPVEDFPTENPNDNSVLGNLRSLLQILRQDMTPGGGPGARRRQRGRLMRRLVR 
RLRRWGLLPRTNTPARASEARSQVTPSAAPLEALDGGTGPAREGGAVGGQDGEQAPPLPIKA 
PLPSASTSPAPTTVPEAPGPLPSLPLEPSLLSGWQALRGRLLPSLGPPGPTRSPPGPHTAV 
LALEDEDDVLLVPLAEPGVWVAEAEDEPLLT 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 442-462 

LDL-receptor class A (LDLRA) domain proteins 

amino acids 411-431, 152-171, 331-350 and 374-393 
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CGAGCTGGGCGAGAAGTAGGGGAGGGCGGTGCTCCGCCGCGGTGGCGGTTGCTATCGCTTCG 
CAGAACCTACTCAGGCAGCCAGCTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCA 
GACGCGATGGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGG 
CCACGTGAAGATGCTGCGGCTGGCACTAACTGTGACATCTATGACCTTTTTTATCATCGCAC 
AAGCCCCTGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATA 
CTTTTATATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATAT 
TATCAACTCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAG 
AAACCACAACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTCTT 
GCCGACGGGGCCCTTATTTACCGGAAGCTTCTGTTCAATCCCAGCGGTCCTTACCAGAAAAA 
GCCTGTGCATGAAAAAAAAGAAGTTTTGTAATTTTATATTACTTTTTAGTTTGATACTAAGT 
ATTAAACATATTTCTGTATTCTTCCAAAAAAAAAAAAAAAAAA 



FIGURE 70 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49645 
xsubunit 1 of 1, 152 aa, 1 stop 
><MW: 17170, pi: 9.62, NX(S/T) : 1 

mdnvqpkikhrpfcfsvkghvkmlr:laltvtsmtffiiaqapepyivitgfevtvilffill 
yvlrldrlmkwlfwplldi inslvttwmli vsvlali petttltvgggvfalvtavcclad 
galiyrkllfnpsgpyqkkpvhekkevl 

Important features: 

Potential type II transmembrane domain: 

amino acids 26-42 

Other potential transmembrane domain: 

amino acids 44-65, 81-101 and 109-129 

Leucine zipper pattern 

amino acids 78-99 and 85-106 

N-myristoylation site. 

amino acids 110-115 

Ribonucleotide reductase large subunit protein 

amino acids 116-127 
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GGGCGAGAAGTAGGGGAGGGCGTGTTCCGCCGCGGTGGCGGTTGCTATCGTTTTGCAGAACC 
TACTCAGGCAGCCAGNTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCAGACGCGA 
TGGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGGCCACGTG 
AAGATGCTGCGGCTGGCACTAACTGNGACATCTATGACCTTTTTTATNATCGCACAAGCCCC 
TGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATACTTTTAT 
ATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATATTATCAAC 
TCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAGAAACCAC 
AACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTNTTGCCGAC 



FIGURE 72 



CAGCCCCGCGCGCCGGCCGAGTCGCTGAGCCGCGGCTGCCGGACGGGACGGGACCGGCTAGG 
CTGGGCGCGCCCCCCGGGCCCCGCCGTGGGCATGGGCGCACTGGCCCGGGCGCTGCTGCTGC 
CTCTGCTGGCCCAGTGGCTCCTGCGCGCCGCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTG 
CCCCTCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCC 
TGCCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 
GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTAC 
CTGGAGATGCTGATCGGGACCCCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAG 
TAACTTTGCCGTGGCAGGAACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGT 
CTAGCACATACCGCTCCAAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACG 
GGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAA 
CATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATAC 
TTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCC 
CTGGTGACACAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCC 
CGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGT 
ATAAAGGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTG 
AAATTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGC 
CATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTGG 
AAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAG 
CTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATCTACCT 
GAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTACATTCAGC 
CCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCATCCACAAAT 
GCGCTGGTGATCGGTGCCACGGTGA.TGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAA 
GAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTT 
CCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAG 
CCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTT 
AATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCA 
ATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATAGCCAGGCCTGACCTCAAGCAA 
CCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGCAGCAGCCGGGATCGATGGTGGCG 
CTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTCCCAGATGCCTTCTAGATTCAC 
TGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCCCTACTTCCAAGAAAAATAATTAA 
AAAAAAAACTTCATTCTAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA45493 
xsubunit 1 of 1, 518 aa, 1 stop 
><MW: 56180, pi: 5.08, NX(S/T): 2 

MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGLAL 
ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS 
YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENF 
FLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGS 
LVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLR 
LPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFR 
ITILPQLYIQPMMGAGLlSrZECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAASPCA 
EIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRC 
QRRPRDPEWNDESSLVRHRWK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 466-494 

N-glycosylation sites. 

amino acids 170-173 and 366-369 

Leucine zipper pattern. 

amino acids 10-31 and 197-118 

Eukaryotic and viral aspartyl proteases 

amino acids 109-118, 252-261 and 298-310 



CGCCTCCGCCTTCGGAGGCTGACGCGCCCGGGCGCCGTTCCAGGCCTGTGCAGGGCGGATCG 
GCAGCCGCCTGGCGGCGATCCAGGGCGGTGCGGGGCCTGGGCGGGAGCCGGGAGGCGCGGCC 
GGCATGGAGGCGCTGCTGCTGGGCGCGGGGTTGCTGCTGGGCGCTTACGTGCTTGTCTACTA 
CAACCTGGTGAAGGCCCCGCCGTGCGGCGGCATGGGCAACCTGCGGGGCCGCACGGCCGTGG 
TCACGGGCGCCAACAGCGGCATCGGAAAGATGACGGCGCTGGAGCTGGCGCGCCGGGGAGCG 
CGCGTGGTGCTGGCCTGCCGCAGCCAGGAGCGCGGGGAGGCGGCTGCCTTCGACCTC CGC CA 
GGAGAGTGGGAACAATGAGGTCATCTTCATGGCCTTGGACTTGGCCAGTCTGGCCTCGGTGC 
GGGCCTTTGCCACTGCCTTTCTGAGCTCTGAGCCACGGTTGGACATCCTCATCCACAATGCC 
GGTATCAGTTCCTGTGGCCGGACCCGTGAGGCGTTTAACCTGCTGCTTCGGGTGAACCATAT 
CGGTCCCTTTCTGCTGACACATCTGCTGCTGCCTTGCCTGAAGGCATGTGCCCCTAGCCGCG 
TGGTGGTGGTAGCCTCAGCTGCCCACTGTCGGGGACGTCTTGACTTCAAACGCCTGGACCGC 
CCAGTGGTGGGCTGGCGGCAGGAGCTGCGGGCATATGCTGACACTAAGCTGGCTAATGTACT 
GTTTGCCCGGGAGCTCGCCAACCAGCTTGAGGCCACTGGCGTCACCTGCTATGCAGCCCACC 
CAGGGCCTGTGAACTCGGAGCTGTTCCTGCGCCATGTTCCTGGATGGCTGCGCCCACTTTTG 
CGCCCATTGGCTTGGCTGGTGCTCCGGGCACCAAGAGGGGGTGCCCAGACACCCCTGTATTG 
TGCTCTACAAGAGGGCATCGAGCCCCTCAGTGGGAGATATTTTGCCAACTGCCATGTGGAAG 
AGGTGCCTCCAGCTGCCCGAGACGACCGGGCAGCCCATCGGCTATGGGAGGCCAGCAAGAGG 
CTGGCAGGGCTTGGGCCTGGGGAGGATGCTGAACCCGATGAAGACCCCCAGTCTGAGGACTC 
AGAGGCCCCATCTTCTCTAAGCACCCCCCACCCTGAGGAGCCCACAGTTTCTCAACCTTACC 
CCAGCCCTCAGAGCTCACCAGATTTGTCTAAGATGACGCACCGAATTCAGGCTAAAGTTGAG 
CCTGAGATCCAGCTCTCCTAACCCTCAGGCCAGGATGCTTGCCATGGCACTTCATGGTCCTT 
GAAAACCTCGGATGTGTGTGAGGCCATGCCCTGGACACTGACGGGTTTGTGATCTTGACCTC 
CGTGGTTACTTTCTGGGGCCCCAAGCTGTGCCCTGGACATCTCTTTTCCTGGTTGAAGGAAT 
AATGGGTGATTATTTCTTCCTGAGAGTGACAGTAACCCCAGATGGAGAGATAGGGGTATGCT 
AGACACTGTGCTTCTCGGAAATTTGGATGTAGTATTTTCAGGCCCCACCCTTATTGATTCTG 
ATCAGCTCTGGAGCAGAGGCAGGGAGTTTGCAATGTGATGCACTGCCAACATTGAGAATTAG 
TGAACTGATCCCTTTGCAACCGTCTAGCTAGGTAGTTAAATTACCCCCATGTTAATGAAGCG 
GAATTAGGCTCCCGAGCTAAGGGACTCGCCTAGGGTCTCACAGTGAGTAGGAGGAGGGCCTG 
GGATCTGAACCCAAGGGTCTGAGGCCAGGGCCGACTGCCGTAAGATGGGTGCTGAGAAGTGA 
GTCAGGGCAGGGCAGCTGGTATCGAGGTGCCCCATGGGAGTAAGGGGACGCCTTCCGGGCGG 
ATGCAGGGCTGGGGTCATCTGTATCTGAAGCCCCTCGGAATAAAGCGCGTTGACCGCCAAAA 
AAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48227 
<subunit 1 of 1, 377 aa, 1 stop 
<MW: 40849, pi: 7.98, NX(S/T): 0 

MEALLLGAGLLLGAYVLWYNLVKAPPCGGMGNLRGRTAVVTGANSGIGKMTALELARRGAR 
WIACRSQERGEAAAFDLRQESGNNSVIFMALDLASLASVRAFATAFLSSEPRLDILIHNAG 
ISSCGRTREAFNLLLRVNHIGPFLLTHLLLPCLKACAPSRWWASAAHCRGRLDFKRLDRP 
WGWRQELRAYADTKLANVLFAREKANQLEATGVTCYAAHPGPVNSELFLRHVPGWLRPLLR 
PLAWLVLRAPRGGAQTPLYCALQEGIEPLSGRYFANCHVEEVPPAARDDRAAHRLWEASKRL 
AGLGPGEDAEPDEDPQSEDSEAPSSLSTPHPEEPTVSQPYPSPQSSPDLSKMTHRIQAKVEP 
EIQLS 

Important features: 
Signal peptide: 

amino acids 1-16 

Glycosaminoglycan attachment site. 

amino acids 46-49 

Short -chain alcohol dehydrogenase family 

amino acids 37-49 and 114-124 



FIGURE 76 

GGAG<3AGACAGCCTCCTGGQGGGCAGGGGTTCCCTGCCTCTGCTGCTCCTGCTCAT CATG GGAGGCATGGCTCAG 
GACTCCCCGCCCCAGATCCTAGTCCACCCCCAGGACCaGCTGTTCCAGGGCCCTGGCCCTGCCAGGATGAGCTG^ 
CAAGCCTCAGGCCAGCCACCTCCCACCATCCGCTGGTTGCTGAATGGGCAGCCCCTGAGCATGGTGCCCCCAGAC 
CCACACCACCTCCTGCCTGATGGGACCCTTCTGCTC 

GCCCTGTCCACAGACCTGGGTGTCTACACATGTGAGGCCAGCAACCGGCTTGGCACGGCAGTCAGCAGAGGCGCT 

CGGCTGTCTGTGGOTGTCCTCCGGGAGGATTTCCAGATCCAGCCTCGGGACATGGTGGCTGTGGTGGGTGAGCAG 

TTTACTCTGGAATGTGGGCCGCCCTGGGGCCACCCAGAGCCCACAGTCTCATGGTGGAAAGATGGGAA 

GCCCTCCAGCCCGGAAGGCACACAGTGTCC(3GGGGGTCCCTGCTGATGGCAAGAGCAGAGAAGAGTGACGAAGGG 

ACOTACATGTGTGTGGCCACCAACAGCGCAGGAC^ 

CAGGACTACACGGAGCCTGTGGAGCTTCTGGCTGTGCGAATTCAGCTGGAAAATGTGACaCTGCTGAACCCGGAT 
CCTGC^GAGGGCCCCAAGCCTAGACCGGCGGTGTGGCTCAGCTGGAAGGTCAGTGGCCCTGGTGCGCCTGCCCS^A 
TCTTACACGGCCTTGTTCAGGACCCAGACTGCCCCGGGAGGCCAGGGAGCTCCGTGGGCAGAGGAGCTGCTGGCC 
GGCTGGCAGAGCGCAGAGCTTGGAGGCCTCCACTGGGGCCAAGACTACGAGTTCAAAGTGAGACCATCCTCTGGC 
CGGGCTCGAGGCCCTGACAGCAACGTGCTGCTCCTGAGGCTGCCGGAAAAAGTGCCCAGTGCCCCACCTCAGGAA 
GTGACTCTAAAGCCTGGQ^TGGCACTGTCTTTGTGAGCTGGGTCCCACCACCTGCTGAAAACCACAATGGCATC 
ATCCGTGGCTACCAGGTCTGGAGCCTGGGCJiACACATCACTGCCACCAGCCTACTGGACTGTAGTTGGTGAGCAG 
ACCC^GCTGGAAATCGCCaCCC^TATGC^ 

GGGGAGCCCAGTAGACCTGTCTGCCTCCTTTOAGAGCAGGCCATGGAGCGAGCCACCCAAGAACCCAGTGAGCAT 
GGTCCCTGGACCCTGGAGCAGCTGAGGGOTACCTTGAAGCGGCCTGAGGTCATTGCC^CCTGCGGTGTTGCACTC 
TGGCTGCTGCTTCTGGGCACCGCCGTGTGTATCCACCGCCGGCGCCGAGCTAGGGTGCACCTGGGCCCAGGTCTG 
TACAGATATACCAGTGAGGATGCCATCCTAJlAACACAGGATGGATCACAGTGACTCCCAGTGGTTGGGAGAC^ 

tggcgttccacctctggctctcgggacctgagcagc^ 

gacccactagactgtcgtcgctccttgctctcctgggactcccgaagccccggcgtgcccctgcttccagacacc 

AGCACTTTTTATGGCTCCCTCATCGCTGAGCT^ 

gt(^ggcgcctcc<^ccccagctggcccagct^ 

GGACTCTCTTCTCCCCGCTTGTCTCTGGCCCCTGCAGAGGCTTGGAAGGCCAAAAAGAAGCAGGAGCTGCAGCAT 

GCC^CAGTTCCCCACTGCTCCGGGGCAGCCACTCCTTGGAGCTCCGGGCCTGTGAGTTAGGAAATAGAGGTTCC 

AAGAACCTTTCCCAAAGCCCAGGAGCTGTGCCCCAAGCTCTGGTTGCCTGGCGGGCCCTGGGACCGAAACTCCTC 

AGCTCCTCAAATGAGCTGGTTACTCGTCATCT^ 

AGTC^^GACCCAGCCTCCGGTGGCACCACAGGCT^ 

CTTAGCCCCTGCAGTCCCCCTAGCCCCCAGGCCTCTTCCCTCTCTGGCCCCAGCCCAGCTTCCAGTCGCCTGTCC 
AGCTCCTCaCTGTCATCCCTGGGGGAGGATCAAGACAGCGTGCTGACCCCTGAGGAGGTAGCCCTGTGCTTGGAA 
CTCAGTGAGGGTGAGGAGACTCCCAGGAACAGCGTC^ 

GGGTACATCAGCGTCCC^CAGCCTCAGAGTTCACGGACATGGGCAGGACTGGAGGAGGGGTGGGGCCC^AGGGG 

GGAGTCTTGCTGTGCCCACCTCGGCCCTGCCTCACCCCCACCCCCAGCGAGGGCTCCTTAGCCAATGGTTGGGGC 

TCAGCCTCTGAGGACAATGCCGCCAGCGCCAGAGCCAGCCTTGTCAGCTCCTCCGATGGOTCCTTCCTCGCTGAT 

GCTCACTTTGCCCGGGCCCTGGCAGTGGCTGTGGATAGCTTTGGTTTCGGTCTAGAGCCCAGGGAGGCAGACTGC 

GTCTTCATAGATGCCTCATC^CCTCCCTCCCCaCGGGATGAGATCTTCCTGACCCCCAACCTCTCCCTGCCCCTG 

TGGGAGTGGAGGCCAGACTGGTTGGAAGACATGGAGGTCAGCCACACCCAGCGGCTGGGAAGGGGGATGCCTCCC 

TGGCCCCCTGACTCTCAGATCTCTTCCCAGAGAAGTCAGCTCCACTGTCGTATGCCCAAGGCTGGTGCTTCTCCT 

GTAGATTACTCCTGAACCGTGTCCCTGAGACTTCCCAGACGGGAATC^GAACCACTTCTCCTGTCCACCCACAAG 

ACCTGGGCTGTGGTGTGTGGGTCTTGGCCTGTGTTTCTCTGCAGCTGGGGTCCACCTTCCCAAGCCTCCAGAGAG 

TTCTCCCTCCACGATTGTGAAAACAAATGAAAACAAAATT^ 

ACATCATCTCC^CCTGACTCCTAGCCACTGCTTTC 

CTGAGGAGCaGCCCTGCCTGCTGCTCTTCCCCC^CCATTTGGATCACAGGAAGTGGAGGAGCCAGAGGTGCCTTT 
GTGGAGGACAGCAGTGGCTGCTGGGAGAGGGCTGTGGAGGAAGGAGCTTCTCGGAGCCCCCTCTCAGCCTTACCT 
GGGCCCCTCCTCTAGAGAAGAGCTCAACTCTCTCCCAACCTCACCaTGGAAAGAAAATAATTATGAATGCCACTG 
AGGCACTGAGGCCCTACCTCATGCCAAACAAAGGGTTCAAGGCTGGGTCTAGCGAGGATGCTGAAGGAAGGGAGG 
TATGAGACCGTAGGTOW^GCACCATCCTCGTACTGTTGTCACTATGAGCTTAAGAAATTTGATACC^TAAAAT 
GGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 77 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA41404 
<subunit 1 of 1, 985 aa, 1 stop 
<MW: 105336, pi: 6.55, NX(S/T): 7 

MGGMAQDSPPQILVHPQDQLFQGPGPARMSCQASGQPPPTIRWLLNGQPLSMVPPDPHHLLP 
DGTLLLLQPPARGHAHDGQALSTDLGVYTCEASNRLGTAVSRGARLSVAVLREDFQIQPRDM 
VAWGEQFTLECGPPWGHPEPTVSWWKDGKPIALQPGRHTVSGGSLLMARAEKSDEGTYMCV 
ATNSAGHRESRAARVSIQEPQDYTEPVELLAVRIQLENVTLLNPDPAEGPKPRPAVWLSWKV 
SGPAAPAQSYTALFRTQTAPGGQGAPWAEELLAGWQSAELGGLHWGQDYEFKVRPSSGRARG 
PDSNVLLLRLPEKVPSAPPQEVTLKPGNGTVFVSWVPPPAENHNGIIRGYQVWSLGNTSLPP 
ANWTWGEQTQLEIATHMPGSYCVQVAAVTGAGAGEPSRPVCLLLEQAMERATQEPSEHGPW 
TLEQLRATLKRPEVIATCGVALWLLLLGTAVCIHRRRRARVHLGPGLYRYTSEDAILKHRMD 
HSDSQWLADTWRSTSGSRDLSSSSSLSSRLGADARDPLDCRRSLLSWDSRSPGVPLLPDTST 
FYGSLIAELPSSTPARPSPQVPAVRRLPPQLAQLSSPCSSSDSLCSRRGLSSPRLSLAPAEA 
WKAKKKQELQHANSSPLLRGSHSLELRACELGNRGSKNLSQSPGAVPQALVAWRALGPKLLS 
SSNELVTRHLPPAPLFPHETPPTQSQQTQPPVAPQAPS S ILLPAAPI PI LS PCS PPSPQAS S 
LSGPSPASSRLSSSSLSSLGEDQDSVLTPEEVALCLELSEGEETPRNSVSPMPRAPSPPTTY 
GYISVPTASEFTDMGRTGGGVGPKGGVLLCPPRPCLTPTPSEGSLANGWGSASEDNAASARA 
SLVSSSDGSFLADAHFARALAVAVDSFGFGLEPREADCVFIDASSPPSPRDEIFLTPNLSLP 
LWEWRPDWLEDMEVSHTQRLGRGMPPWPPDSQISSQRSQLHCRMPKAGASPVDYS 

Important features: 
Transmembrane domain: 
amino acids 448-467 
N-glycosylation sites: 

amino acids 224-227, 338-341, 367-370, 374-377, 658-661 and 926- 
929 

N-myristoylation sites. 

amino acids 47-52, 80-85, 88-93, 99-104, 105-110, 181-186, 272- 
277, 290-295, 355-360, 403-408, 462-467, 561-566, 652-657, 849- 
854 and 876-881 

Phospho tyrosine interaction domain proteins 

amino acids 740-753 



FIGURE 78 

CTCCCACGGTGTCCAGCGCCCAGAATGCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCT 
CCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGT 
CCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGT 
GGGATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAAT 
GAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGA 
ACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAG 
TCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCAC 
CTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCC 
CAGGATTGACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGG 
GCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACAC 
AGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGC 
TGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGG 
GTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGC 
CGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCA 
CGGAGACACAGAGGAACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCC 
CCTTCCCAGGCCCCTGAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGA 
GCTGGGCTTCTCGAAGTTTGTCTCAGCGTAGGGCAGGAGGCCCTCCTGGCCAGGCCAGCAGT 
GAAGCAGTATGGCTGGCTGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAG 
TCCAGCTGCCCGGACTCCAGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCA 
GCCTGACCTAGAAGCGTTTGTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTG 
GAGACTGGGACATCCCTGATAGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCA 
GCAGGGCCAGACAAGGCTCAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGC 
CTCATGCCCAGTGTCGGACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCC 
TGGCGTCCTCAGACTTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCT 
GGGGTGAGACTGGGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCT 
GTGAAAAACGTGATTCCTGGCCCCACCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAG 
GACTCTGAATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTG 
ATGAACGCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCC 
CAATAGATCTGCTCTGTCTGCGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAG 
TCCAGGCCTTGGTCAGGTCAGGTGCACATTGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 
TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAAGA 
AAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTAGAGA 
GAAAAGTAGAAAACCAGAGTGCACGTAGGTGTCTAACACAGAGGAGAGTAGGAACAGGGCGG 
ATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTGGTAAAGTA 
GCACAACTACTATTTTTTTTCTTTTTCCATTATTATTGTTTTTTAAGACAGAATCTCGTGCT 
GCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGGGTTCAAGTGATT 
CTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACCACACCTGGCTAATT 
TTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGAC 
CTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTACAGGCATGAGCCACTGTG 
TCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTCAGTATGCAAAACTTGGAAAG 
ATGGAGGAGAAAAAGAAAAGGAAGAAAAAAATGTCACCCATAGTCTCACCAGAGACTATCAT 
TATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTCACATAATTTGCCGGTGTTCTT 
TTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGCCTTTTCCACCTTATCGTTCC 
ATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTTATAAATAAAATGTTCATCA 
GCTGCATAAAAAAAAAAAAAA 



FIGURE 79 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA44196 
<subunit 1 of 1, 332 aa, 1 stop 
<MW: 36143, pi: 5.89, NX{S/T) : 1 

MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSRCS 
GTIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFV 
FPGPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPG 
TSQYGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRI 
LAPVLVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGD 
VISMPPLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 248-269 

N-glycosylation site. 

amino acids 96-99 

Fibrinogen beta and gamma chains C- terminal domain. 

amino acids 104-113 



Ig like V-type domain: 

amino acids 13-128 



FIGURE 80 

TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGA 
GCCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTC 
GCTCTGTTGCCCAGGCTAGAGTGTACTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCT 
GGACTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAA 
GCCCTGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAA 
AGCTCAGCTCTGAGCCAGAGTGGTGGTGGCTCCACCTCTGCCGCCGGCATAGAAGCCAGGAG 
CAGGGCTCTCAGAAGGCGGTGGTGCCCAGCTGGGATCATGTTGTTGGCCCTGGTCTGTCTGC 
TCAGCTGCCTGCTACCCTCCAGTGAGGCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTG 
CTACATGACTTCGGGCTGGACGGATACCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGC 
TTATTTCACAAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACA 
ACGGGATCTTCCAGATCAACAGCCGGAGGTGGTGCAGCAACCTCACCCCGAACGTCCCCAAC 
GTGTGCCGGATGTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGC 
CATGAAGATAACCCAAGAGCCTCAGGGTCTGGGTTACTGGGAGGCCTGGAGGCATCACTGCC 
AGGGAAAAGACCTCACTGAATGGGTGGATGGCTGTGACTTCTAGGATGGACGGAACCATGCA 
CAGCAGGCTGGGAAATGTGGTTTGGTTCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATA 
AAGGATGGTTGAACGTGAAA 



FIGURE 81 



</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA52187 
<subunit 1 of 1, 146 aa, 1 stop 
<MW: 16430, pi: 5.05, NX(S/T) : 1 

MLLALVCLLS CLLPS SEAKLYGRCELARVLHDFGLDGYRGYSLADWVCLAYFTSGFNAAALD 
YEADGSTNNGIFQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGY 
WEAWRHHCQGKDLTEWVDGCDF 

Important features: 
Signal peptides 

amino acids 1-18 

N-myristoylation site. 

amino acids 67-72 

Homolgous region to Alpha- lactalbumin / lysozyme C proteins. 

amino acids 34-58 (catalytic domain) , 111-132 and 66-107 



FIGURE 82 



AGCCGCTGCCCCQGGCCGGGCGCCCGCGGCGGCACC ATGA GTCCCCGCTCGTGCCTGCGTTC 
GCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCCGCCGCGAGCAACTGGCTGTACCTGGCCA 
AGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTGCGAGAAACTCAAGGGCCTGATC 
CAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCATGGACTCGGTGCGCCGCGGTGC 
CCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCGGCGCTGGAACTGCTCCACAC 
TCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGACTCGGGAGGCGGCCTTCGTG 
TACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGTGCAGCAGTGGGGAGCT 
GGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAGGGCTTCCAGTGGTCAG 
GATGCTCTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTGTGGATGTGCGGGAG 
AGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACAATGAGGCCGGCAG 
GAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGTCAGGCTCCTGTG 
AGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACGCACTGAAGGAG 
AAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGGCACTGGTACC 
ACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACCTGGTGTACTTGGAGCCTAGCCCCG 
ACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACATGCAACAAG 
ACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACACGGCGCA 
GGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTTCGTCAAGTGCCGGC 
AGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGATGACCGCCTGCCTAGCCCTGCGCCGGC 
AACCACCTAGTGGCCCAGGGAAGGCCGATAATTTAAACAGTCTCCCACCACCTACCCCAAGA 
GATACTGGTTGTATTTTTTGTTCTGGTTTGGTTTTTGGGTCCTCATGTTATTTATTGCCGAA 
ACCAGGCAGGCAACCCCAAGGGCACCAACCAGGGCCTCCCCAAAGCCTGGGCCTTTGTGGCT 
GCCACTGACCAAAGGGACCTTGCTCGTGCCGCTGGCTGCCCGCATGTGGCTGCCACTGACCA 
CTCAGTTGTTATCTGTGTCCGTTTTTCTACTTGCAGACCTAAGGTGGAGTAACAAGGAGTAT 
TACCACCACATGGCTACTGACCGTGTCATCGGGGAAGAGGGGGCCTTATGGCAGGGAAAATA 
GGTACCGACTTGATGGAAGTCAGACCCTCTGGAAAAAAGAACTCTTAACTCTCCAGCACACA 
TACACATGGACTCCTGGCAGCTTGAGCCTAGAAGCCATGTCTCTCAAATGCCCTGAGAAAGG 
GAACAAGCAGATACCAGGTCAAGGGCACCAGGTTCATTTCAGCCCTTACATGGACAGCTAGA 
GGTTCGATATCTGTGGGTCCTTCCAGGCAAGAAGAGGGAGATGAGAGCAAGAGACGACTGAA 
GTCCCACCCTAGAACCCAGCCTGCCCCAGCCTGCCCCTGGGAAGAGGAAACTTAACCACTCC 
CCAGACCCACCTAGGCAGGCATATAGGCTGCCATCCTGGACCAGGGATCCCGGCTGTGCCTT 
TGCAGTCATGCCCGAGTCACCTTTCACAGCGCTGTTCCTCCATGAAACTGAAAAACACACAC 
ACACACACACACACACACACACACACACACACACACACGGACACACACACACACCTGCGAGA 
GAGAGGGAGGAAAGGGCTGTGCCTTTGCAGTCATGCCCGAGTCACCTTTCACAGCACTGTTCCTC 



FIGURE 83 

< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA4 8328 
<subunit 1 of 1, 351 aa, 1 stop 
<MW: 39052, pi: 8.97, NX(S/T): 2 

MSPRSCLRSLRLLVFAVFSAAASNWLYIAKLSSVGSISEEETCEKLKGLIQRQVQMCKRNLE 
VMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKWTQGTREAAFVYAISSAGVAFAV 
TRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRERSKGASSSRALM 
NLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHALKEKFDGATEVEPRR 
VGSSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGRTCNKTSKAIDGCELLC 
CGRGFHTAQVELAERCSCKFHWCCFVKCRQCQRLVELHTCR 

Important features: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 88-91 and 297-300 

Wnt-1 family signature. 

amino acids 206-215 

Homologous region to Wnt-1 family proteins 

amino acids 183-235, 305-350, 97-138, 53-92 and 150 -174 



FIGURE 84 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCAT 
CGCCATGGACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGC 
CCTGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTC 
ACCACAGTCCTTTGGGCTGTGATTCTGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGC 
GGCGCTGCTTGACGGCCACGACCTGCTGAGGACAAACGCCTCGAAGCAGACGGCGGCGCTGG 
GTGCCCTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTG 
CAGACCACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCT 
GCGGGAACTGCGTGAGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACG 
TCCGCACTGAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCG 
TGCCCCACGTCGTGGCTGTCCTTCGAGGGCTCCTGCTACTTTTTCTCTGTGCCAAAGACGAC 
GTGGGCGGCGGCGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCC 
TGGATGAGCAGGGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGG 
GCTGTGCGCCATCTGGGCAAGGTTCAGGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTT 
CAGCCACTGGAACCAGGGAGAGCCCAATGACGCTTGGGGGCGCGAGAACTGTGTCATGATGC 
TGCACACGGGGCTGTGGAACGACGCACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAG 
AAAAGGCACIAACTGCTGACCCCGCCC1AGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTA 
TCCTGGGGGCTGCTCACCTCCCTGGCTCCTGGAGCTGATTGCCAAAGAGTTTTTTTCTTCCT 
CATCCACCGCTGCTGAGTCTCAGAAACACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCC 
TGGGCTCTGGGACCTCCATGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACC 
TCCACTAGCTCCAAAATCCCTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAA 
CCAAGGTTAGGTGACTGAGGACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGA 
AGCTGTTTTTGCAGCCTGAGGAAGCATCAATAAATATTTGAGAAATGAAAAAA 



FIGURE 85 



</usr/ seq<3b2/sst/DNA/Dnaseqs .min/ss .DNA56352 
<subunit 1 of 1, 293 aa, 1 stop 
<MW: 32562, pi: 6.53, NX(S/T): 2 

MDTTRYSKWGGSSEEVPGGPWGRWVHWSRRPLFLALAVLVTTVLWAVILSILLSKASTERA^ 
LLDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKLMEQESALR 
ELRERVTQGLAEAGRGREDVRTELFRALEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTW 
AAAQDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFS 
HWNQGEPNDAWGRENCVMMLHTGLWNDAPCDSEKDGWICEKRHNC 

Important features: 

Type II transmembrane domain: 

amino acids 31-54 

N-glycosylation sites. 

amino acids 73-76 and 159-162 

Leucine zipper pattern. 

amino acids 102-123 

N-myristoylation sites. 

amino acids 18-23, 133-138 and 242-247 

C-type lectin domain signature. 

amino acids 264-287 



FIGURE 86 



GCCAGGGGAAGAGGGTGATCCGACCCGGGGAAGGTCGCTGGGCAGGGCGAGTTGGGAAAGCG 
GCAGCCCCCGCCGCCCCCGCAGCCCCTTCTCCTCCTTTCTCCCACGTCCTATCTGCCTCTCG 
CTGGAGGCCAGGCCGTGCAGCATCGAAGACAGGAGGAACTGGAGCCTCATTGGCCGGCCCGG 
GGCGCCGGCCTCGGGCTTAAATAGGAGCTCCGGGCTCTGGCTGGGACCCGACCGCTGCCGGC 
CGCGCTCCCGCTGCTCCTGCCGGGTGATGGAAAACCCCAGCCCGGCCGCCGCCCTGGGCAAG 
GCCCTCTGCGCTCTCCTCCTGGCCACTCTCGGCGCCGCCGGCCAGCCTCTTGGGGGAGAGTC 
CATCTGTTCCGCCAGAGCCCCGGCCAAATACAGCATCACCTTCACGGGCAAGTGGAGCCAGA 
CGGCCTTCCCCAAGCAGTACCCCCTGTTCCGCCCCCCTGCGCAGTGGTCTTCGCTGCTGGGG 
GCCGCGCATAGCTCCGACTACAGCATGTGGAGGAAGAACCAGTACGTCAGTAACGGGCTGCG 
CGACTTTGCGGAGCGCGGCGAGGCCTGGGCGCTGATGAAGGAGATCGAGGCGGCGGGGGAGG 
CGCTGCAGAGCGTGCACGAGGTGTTTTCGGCGCCCGCCGTCCCCAGCGGCACCGGGCAGACG 
TCGGCGGAGCTGGAGGTGCAGCGCAGGCACTCGCTGGTCTCGTTTGTGGTGCGCATCGTGCC 
CAGCCCCGACTGGTTCGTGGGCGTGGACAGCCTGGACCTGTGCGACGGGGACCGTTGGCGGG 
AACAGGCGGCGCTGGACCTGTACCCCTACGACGCCGGGACGGACAGCGGCTTCACCTTCTCC 
TCCCCCAACTTCGCCACCATCCCGCAGGACACGGTGACCGAGATAACGTCCTCCTCTCCCAG 
CCACCCGGCCAACTCCTTCTACTACCCGCGGCTGAAGGCCCTGCCTCCCATCGCCAGGGTGA 
CACTGCTGCGGCTGCGACAGAGCCCCAGGGCCTTCATCCCTCCCGCCCCAGTCCTGCCCAGC 
AGGGACAATGAGATTGTAGACAGCGCCTCAGTTCCAGAAACGCCGCTGGACTGCGAGGTCTC 
CCTGTGGTCGTCCTGGGGACTGTGCGGAGGCCACTGTGGGAGGCTCGGGACCAAGAGCAGGA 
CTCGCTACGTCCGGGTCCAGCCCGC(ZAACAACGGGAGCCCCTGCCCCGAGCTCGAAGAAGAG 
GCTGAGTGCGTCCCTGATAACTGCGTCTAAGACCAGAGCCCCGCAGCCCCTGGGGCCCCCCG 
GAGCCATGGGGTGTCGGGGGCTCCTGTGCAGGCTCATGCTGCAGGCGGCCGAGGGCACAGGG 
GGTTTCGCGCTGCTCCTGACCGCGGTGAGGCCGCGCCGACCATCTCTGCACTGAAGGGCCCT 
CTGGTGGCCGGCACGGGCATTGGGAAACAGCCTCCTCCTTTCCCAACCTTGCTTCTTAGGGG 
CCCCCGTGTCCCGTCTGCTCTCAGCCTCCTCCTCCTGCAGGATAAAGTCATCCCCAAGGCTC 
CAGCTACTCTAAATTATGTCTCCTTATAAGTTATTGCTGCTCCAGGAGATTGTCCTTCATCG 
TCCAGGGGCCTGGCTCCCACGTGGTTGCAGATACCTCAGACCTGGTGCTCTAGGCTGTGCTG 
AGCCCACTCTCCCGAGGGCGCATCCAAGCGGGGGCCACTTGAGAAGTGAATAAATGGGGCGG 
TTTCGGAAGCGTCAGTGTTTCCATGTTATGGATCTCTCTGCGTTTGAATAAAGACTATCTCT 
GTTGCTCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 87 



>< / usr / seqdb2 / s s t /DNA/Dnasegs . min/ s s . DNA53 9 7 1 
xsubunit 1 of 1, 331 aa, 1 stop 
><MW: 35844, pi: 5.45, NX(S/T): 2 

MENPS PAAALGKALCALLLATLGAAGQPLGGES I CS ARAPAKYS ITFTGKWSQTAFPKQYPL 
FRPPAQWSSLLGAAHSSDYSMWRKNQYVSNGLRDFAERGEAWALMKEIEAAGEALQSVHEVF 
SAPAVPSGTGQTSAELEVQRRHSLVSFWRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYP 
YDAGTDSGFTFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRLKALPPIARVTLLRLRQSP 
RAFIPPAPVLPSRDNEIVDSASVPETPLDCEVSLWSSWGLCGGHCGRLGTKSRTRYVRVQPA 
NNGSPCPELEEEAECVPDNCV 

Important features: 
Signal peptide: 

amino acids 1-26 



mm 



FIGURE 8 



GGCGGCGTCCGTGAGGGGCTCCTTTGGGCAGGGGTAGTGTTTGGTGTCCCTGTCTTGCGTGA 
TATTGACAAACTGAAGCTTTCCTGCACCACTGGACTTAAGGAAGAGTGTACTCGTAGGCGGA 
CAGCTTTAGTGGCCGGCCGGCCGCTCTCATCCCCCGTAAGGAGCAGAGTCCTTTGTACTGAC 
CAAGATGAGCAACATCTACATCCAGGAGCCTCCCACGAATGGGAAGGTTTTATTGAAAACTA 
CAGCTGGAGATATTGACATAGAGTTGTGGTCCAAAGAAGCTCCTAAAGCTTGCAGAAATTTT 
ATCCAACTTTGTTTGGAAGCTTATTATGACAATACCATTTTTCATAGAGTTGTGCCTGGTTT 
CATAGTCCAAGGCGGAGATCCTACTGGCACAGGGAGTGGTGGAGAGTCTATCTATGGAGCGC 
CATTCAAAGATGAATTTCATTCACGGTTGCGTTTTAATCGGAGAGGACTGGTTGCCATGGCA 
AATGCTGGTTCTCATGATAATGGCAGCCAGTTTTTCTTCACACTGGGTCGAGCAGATGAACT 
TAACAATAAGCATACCATCTTTGGAAAGGTTACAGGGGATACAGTATATAACATGTTGCGAC 
TGTCAGAAGTAGACATTGATGATGACGAAAGACCACATAATCCACACAAAATAAAAAGCTGT 
GAGGTTTTGTTTAATCCTTTTGATGACATCATTCCAAGGGAAATTAAAAGGCTGAAAAAAGA 
GAAACCAGAGGAGGAAGTAAAGAAATTGAAACCCAAAGGCACAAAAAATTTTAGTTTACTTT 
CATTTGGAGAGGAAGCTGAGGAAGAAGAGGAGGAAGTAAATCGAGTTAGTCAGAGCATGAAG 
GGCAAAAGCAAAAGTAGTCATGACTTGCTTAAGGATGATCCACATCTCAGTTCTGTTCCAGT 
TGTAGAAAGTGAAAAAGGTGATGCACCAGATTTAGTTGATGATGGAGAAGATGAAAGTGCAG 
AGCATGATGAATATATTGATGGTGATGAAAAGAACCTGATGAGAGAAAGAATTGCCAAAAAA 
TTAAAAAAGGACACAAGTGCGAATGTTAAATCAGCTGGAGAAGGAGAAGTGGAGAAGAAATC 
AGTCAGCCGCAGTGAAGAGCTCAGAAAAGAAGCAAGACAATTAAAACGGGAACTCTTAGCAG 
CAAAACAAAAAAAAGTAGAAAATGCAGCAAAACAAGCAGAAAAAAGAAGTGAAGAGGAAGAA 
GCCCCTCCAGATGGTGCTGTTGCCGAATACAGAAGAGAAAAGCAAAAGTATGAAGCTTTGAG 
GAAGCAACAGTCAAAGAAGGGAACTTCCCGGGAAGATCAGACCCTTGCACTGCTGAACCAGT 
TTAAATCTAAACTCACTCAAGCAATTGCTGAAACACCTGAAAATGACATTCCTGAAACAGAA 
GTAGAAGATGATGAAGGATGGATGTCACATGTACTTCAGTTTGAGGATAAAAGCAGAAAAGT 
GAAAGATGCAAGCATGCAAGACTCAGATACATTTGAAATCTATGATCCTCGGAATCCAGTGA 
ATAAAAGAAGGAGGGAAGAAAGCAAAAAGCTGATGAGAGAGAAAAAAGAAAGAAGATAAAAT 
GAGAATAATGATAACCAGAACTTGCTGGAAATGTGCCTACAATGGCCTTGTAACAGCCATTG 
TTCCCAACAGCATCACTTAGGGGTGTGAAAAGAAGTATTTTTGAACCTGTTGTCTGGTTTTG 
AAAAACAATTATCTTGTTTTGCAAATTGTGGAATGATGTAAGCAAATGCTTTTGGTTACTGG 
TACATGTGTTTTTTCCTAGCTGACCTTTTATATTGCTAAATCTGAAATAAAATAACTTTCCT 
TCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 89 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA50919 
xsubunit 1 of 1, 472 aa, 1 stop 
><MW: 53847, pi; 5.75, NX<S/T): 2 

MSNI YI QE PPTNGKVLLKTTAGD ID I ELWS KEAPKACRNF IQLCLEAYYDNT I FHRWPGF I 
VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRADELN 
NKHTI FGKVTGDTVYNMLRLS EVDIDDDERPHNPHKI KSCEVLFNPFDD 1 1 PRE IKRLKKEK 
PEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEVNRVSQSMKGKSKSSHDLLKDDPHLSSVPW 
ESEKGDAPDLVDDGEDESAEHDEYIDGDEKNLMRERIAKKLKKDTSAIWKSAGEGEVEKKSV 
SRSEELRKEARQLKRELLAAKQKKVENAAKQAEKRSEEEEAPPDGAVAEYRREKQKYEALRK 
QQSKKGTSREDQTIALIJSrQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSRKVK 
DASMQDSDTFE I YDPRNPVNKRRREESKKLMREKKERR 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 109-112 and 201-204 

Cyclophilin-type peptidyl -prolyl cis- trans isomerase signature. 

amino acids 49-66 

Homologous region to Cyclophilin-type peptidyl -prolyl cis -trans 
isomerase 

amino acids 96-140, 49-89 and 22-51 



FIGURE 90 

CGCCGCCGTTGGGGCTGGAAGTTCCCGCCAGGTCCGTGCCGGGCGAGAGAGATGCTGCCCGG 
CCCGCCTCGGCTTTGAGGCGAGAGAAGTGTCCCAGACCCATTTCGCCTTGCTGACGGCGTCG 
AGCCCTGGCCAGACATGTCCACAGGGTTCTCCTTCGGGTCCGGGACTCTGGGCTCCACCACC 
GTGGCCGCCGGCGGGACCAGCACAGGCGGCGTTTTCTCCTTCGGAACGGGAACGTCTAGCAA 
CCCTTCTGTGGGGCTCAATTTTGGAAATCTTGGAAGTACTTCAACTCCAGCAACTACATCTG 
CTCCTTCAAGTGGTTTTGGAACCGGGCTCTTTGGATCTAAACCTGCCACTGGGTTCACTCTA 
GGAGGAACAAATACAGGTGCCTTGCACACCAAGAGGCCTCAAGTGGTCACCAAATATGGAAC 
CCTGCAAGGAAAACAGATGCATGTGGGGAAGACACCCATCCAAGTCTTTTTAGGAGTCCCCT 
TCTCCAGACCTCCTCTAGGTATCCTCAGGTTTGCACCTCCAGAACCCCCGGAGCCCTGGAAA 
GGAATCAGAGATGCTACCACCTACCCGCCTGGATGGAGTCTCGCTCTGTCGCCAGGCTGGAG 
TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGAGTCTCCTGC 
CTCAGCCTCTGAGTGTCTGGGGCTACAGGTGCCTGCAGGAGTCCTGGGGCCAGCTGGCCTCG 
ATGTACGTCAGCACGCGGGAACGGTACAAGTGGCTGCGCTTCAGCGAGGACTGTCTGTACCT 
GAACGTGTACGCGCCGGCGCGCGCGCCCGGGGATCCCCAGCTGCCAGTGATGGTCTGGTTCC 
CGGGAGGCGCCTTCATCGTGGGCGCTGCTTCTTCGTACGAGGGCTCTGACTTGGCCGCCCGC 
GAGAAAGTGGTGCTGGTGTTTCTGCAGCACAGGCTCGGCATCTTCGGCTTCCTGAGCACGGA 
CGACAGCCACGCGCGCGGGAACTGGGGGCTGCTGGACCAGATGGCGGCTCTGCGCTGGGTGC 
AGGAGAACATCGCAGCCTTCGGGGGAGACCCAGGAAATGTGACCCTGTTCGGCCAGTCGGCG 
GGGGCCATGAGCATCTCAGGACTGATGATGTCACCCCTAGCCTCGGGTCTCTTCCATCGGGC 
CATTTCCCAGAGTGGCACCGCGTTATTCAGACTTTTCATCACTAGTAACCCACTGAAAGTGG 
CCAAGAAGGTTGCCCACCTGGCTGGATGCAACCACAACAGCACACAGATCCTGGTAAACTGC 
CTGAGGGCACTATCAGGGACCAAGGTGATGCGTGTGTCCAACAAGATGAGATTCCTCCAACT 
GAACTTCCAGAGAGACCCGGAAGAGATTATCTGGTCCATGAGCCCTGTGGTGGATGGTGTGG 
TGATCCCAGATGACCCTTTGGTGCTCCTGACCCAGGGGAAGGTTTCATCTGTGCCCTACCTT 
CTAGGTGTCAACAACCTGGAATTCAJ^TTGGCTCTTGCCTTATAATATCACCAAGGAGCAGGT 
ACCACTTGTGGTGGAGGAGTACCTGGACAATGTCAATGAGCATGACTGGAAGATGCTACGAA 
ACCGTATGATGGACATAGTTCAAGATGCCACTTTCGTGTATGCCACACTGCAGACTGCTCAC 
TACCACCGAGAAACCCCAATGATGGGAATCTGCCCTGCTGGCCACGCTACAACAAGGATGAA 
AAGTACCTGCAGCTGGATTTTACCACAAGAGTGGGCATGAAGCTCAAGGAGAAGAAGATGGC 
TTTTTGGATGAGTCTGTACCAGTCTCAAAGACCTGAGAAGCAGAGGCAATTCTAAGGGTGGC 
TATGCAGGAAGGAGCCAAAGAGGGGTTTGCCCCCACCATCCAGGCCCTGGGGAGACTAGCCA 
TGGACATACCTGGGGACAAGAGTTCTACCCACCCCAGTTTAGAACTGCAGGAGCTCCCTGCT 
GCCTCCAGGCCAAAGCTAGAGCTTTTGCCTGTTGTGTGGGACCTGCACTGCCCTTTCCAGCC 
TGACATCCCATGATGCCCCTCTACTTCACTGTTGACATCCAGTTAGGCCAGGCCCTGTCAAC 
ACCACACTGTGCTCAGCTCTCCAGCCTCAGGACAACCTCTTTTTTTCCCTTCTTCAAATCCT 
CCCACCCTTCAATGTCTCCTTGTGACTCCTTCTTATGGGAGGTCGACCCAGACTGCCACTGC 
CCCTGTCACTGCACCCAGCTTGGCATTTACCATCCATCCTGCTCAACCTTGTTCCTGTCTGT 
TCACATTGGCCTGGAGGCCTAGGGCAGGTTGTGACATGGAGCAAACTTTTGGTAGTTTGGGA 
TCTTCTCTCCCACCCACACTTATCTCCCCCAGGGCCACTCCAAAGTCTATACACAGGGGTGG 
TCTCTTCAATAAAGAAGTGTTGATTAGAAAAAAAAAAA 



FIGURE 91 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44179 
<subunit 1 of 1, 545 aa, 1 stop 
<MW: 58934, pi: 9.45, NX(S/T): 4 

MSTGFSFGSGTLGSTTVAAGGTSTGGVFSFGTGTSSNPSVGLNFGNLGSTSTPATTSAPSSG 
FGTGLFGSKPATGFTLGGTNTGALHTKRPQWTKYGTLQGKQMHVGKTPIQVFLGVPFSRPP 
LGILRFAPPEPPEPWKGIRDATTYPPGWSIALSPGWSAVARSRLTATSASRVQASLLPQPLS 
VWGYRCLQESWGQLASMYVSTRERYKWLRFSEDCLYLNVYAPARAPGDPQLPVMVWFPGGAF 
IVGAASSYEGSDIAAREKVVLVFLQHRLGIFGFLSTDDSHARGNWGLLDQMAALRWVQENIA 
AFGGDPGIWTLFGQSAGAMSISGLMMSPLASGLFHRAISQSGTALFRLFITSNPLKVAKKVA 
HLAGCNHNSTQ I LVNCLRALSGTKVMRVSNKMRFIiQLNFQRDPEE 1 1 WSMS PWDGWI PDD 
PLVLLTQGKVSSVPYLLGVI^LEFNWLLPYNITKEQVPLVVEEYLDITVNEHDWKMLRNRMMD 
IVQDATFVYATLQTAHYHRETPMMGICPAGHATTRMKSTCSWILPQEWA 

Important features: 
Signal peptide: 

amino acids 1-29 

Carboxylest erases type-B serine active site. 

amino acids 312-327 

Carboxylesterases type-B signature 2 . 

amino acids 218-228 

N-glycosylation sites. 

amino acids 318-321, 380-383 and 465-468 



FIGURE 92 



GAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGATGCTACTGCCACTGCT 
GCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTCTGGATACGAGTGCAGG 
AGTCAGTGATGGTGCCGGAGGGCCTGTGCATCTCTGTGCCCTGCTCTTTCTCCTACCCCCGA 
CAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTCAAAGCAGTGACTGAGACAAC 
CAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAGGTGGAAATGAGCACCCGGGGCC 
GATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGCTCCTTGGTGATCAGAGACGCGCAG 
ATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAGAGAGGAAGCTATGTGACATATAATTT 
CATGAACGATGGGTTCTTTCTAAAAGTAACAGTGCTCAGCTTCACGCCCAGACCCCAGGACC 
ACAACACCGACCTCACCTGCCATGTGGACTTCTCCAGAAAGGGTGTGAGCGCACAGAGGACC 
GTCCGACTCCGTGTGGCCTATGCCCCCAGAGACCTTGTTATCAGCATTTCACGTGACAACAC 
GCCAGCCCTGGAGCCCCAGCCCCAGGGAAATGTCCCATACCTGGAAGCCCAAAAAGGCCAGT 
TCCTGCGGCTCCTCTGTGCTGCTGACAGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAG 
AACAGAGTCCTCTCCTCGTCCCATCCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGG 
GGTGAAGGCTGGGGATTCAGGGCGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGC 
AGCGAGCCCTGGACCTCTCTGTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAA 
GCAAACAGGACAGTCCTGGAAAACCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGGGCCA 
AAGCCTGTGCCTGGTCTGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGA 
GGGGACAGGTTCTGAGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTT 
CAAGTGGAGCACGAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGT 
CTCTCTCAGCCTCTCCGTGCACTAT.AAGAAGGGACTCATCTCAACGGCATTCTCCAACGGAG 
CGTTTCTGGGAATCGGCATCACGGCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAG 
ATTCTACCGAAGAGACGGACTCAGACAGAAACCCCGAGGCCCAGGTTCTCCCGGCACAGCAC 
GATCCTGGATTACATCAATGTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAGA 
AAGCCACACCAAACAGTCCTCGGACCCCTCCTCCACCAGGTGCTCCCTCCCCAGAATCAAAG 
AAGAACCAGAAAAAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGC 
CCCAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCCAGGCGTCA 
GACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTC 
CAATGAGGGTCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGGAGGAAGGTAGAGTAAGAG 
GTTGAAGATAACAGAGTGCAAAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCT 
CTCTCTTTCTCTCTCTTTTAAAAAAACATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATC 
CCAGCACTTTGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCCTG 
GCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCAGG 
CGCCTGTAATCCTACCTACTTGGGAAGCTGAGGCAGGAGAATCACTTGAACCTGGGAGACGG 
AGGTTGCAGTGAGCCAAGATCACACCATTGCACGCCAGCCTGGGCAACAAAGCGAGACTCCA 
TCTCAAAAAAAAAATCCTCCAAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTA 
AGGTGGGTGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 
ATCTCTAGAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAGTCCCAGCTGT 
CAGACATTTAAACCAGAGCAACTCCATCTGGAATAGGAGCTGAATAAAATGAGGCTGAGACC 
TACTGGGCTGCATTCTCAGACAGTGGAGGCATTCTAAGTCACAGGATGAGACAGGAGGTCCG 
TACAAGATACAGGTCATAAAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAA 
ATCCCACCAAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTCCT 
GACAGCACCATGACAGTTTACAAATGCCATGGCAACATCAGGAAGTTACCCGATATGTCCCA 
AAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCAAATAAGCAAGAAATAACCATAAAA 
GTGGGCAACCAGCAGCTCTAGGCGCTGCTCTTGTCTATGGAGTAGCCATTCTTTTGTTCCTT 
TACTTTCTTAATAAACTTGCTTTCACCTTAAAAAAA 



FIGURE 93 



>< /us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 4 0 0 2 
xsubunit 1 of 1, 544 aa, 1 stop 
><MW: 60268, pi: 9.53, NX(S/T): 3 

MLLPLLLSSLLGGSQAMDGRFWIRVQESVMVPEGLCISVPCSFSYPRQDWTGSTPAYGYWFK 
AVTETTKGAPVATNHQSREVEMSTRGRFQLTGDPAKGNCSLVIRDAQMQDESQYFFRVERGS 
YVTYNFMNDGFFLKVTVLS FTPRPQDHNTDLTCHVDFSRKGVS AQRTVRLRVAYAPRDLVI S 
ISRDNTPALEPQPQGNVPYLEAQKGQFLRLLCAADSQPPATLSWVLQNRVLSSSHPWGPRPL 
GLELPGVKAGDSGRYTCRAENRLGSQQRALDLSVQYPPENLRVMVSQANRTVLENLGNGTSL 
PVLEGQSLCLVCVTHSSPPARLSWTQRGQVLSPSQPSDPGVLELPRVQVEHEGEFTCHARHP 
LGSQHVSLSLSVHYKKGLISTAFSNGAFLGIGITALLFLCIiALIIMKILPKRRTQTETPRPR 
FSRHSTILDYINVVPTAGPLAQKRNQKATPNSPRTPPPPGAPSPESKKNQKKQYQLPSFPEP 
KSSTQAPESQESQEELHYATLNFPGWPRPEARMPKGTQADYAEVKFQ 

Important features: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 399-418 

N-glycosylation site. 

amino acids 100-103, 297-300 and 306-309 

Immunoglobulins and major histocompatibility complex proteins 
signature. 

amino acids 365-371 



FIGURE 94 

TGAAGAGTAATAGTTGGAATCAAAAGAGTCAACGCAATGAACTGTTATTTACTGCTGCGTTT 
TATGTTGGGAATTCCTCTCCTATGGCCTTGTCTTGGAGCAACAGAAAACTCTCAAACAAAGA 
AAGTCAAGCAGCCAGTGCGATCTCATTTGAGAGTGAAGCGTGGCTGGGTGTGGAACCAATTT 
TTTGTACCAGAGGAAATGAATACGACTAGTCATCACATCGGCCAGCTAAGATCTGATTTAGA 
CAATGGAAACAATTCTTTCCAGTACAAGCTTTTGGGAGCTGGAGCTGGAAGTACTTTTATCA 
TTGATGAAAGAACAGGTGACATATATGCCATACAGAAGCTTGATAGAGAGGAGCGATCCCTC 
TACATCTTAAGAGCCCAGGTAATAGACATCGCTACTGGAAGGGCTGTGGAACCTGAGTCTGA 
GTTTGTCATCAAAGTTTCGGATATCAATGACAATGAAC CAAAATTC CTAGATGAAC CTTATG 
AGGCCATTGTACCAGAGATGTCTCCAGAAGGAACATTAGTTATCCAGGTGACAGCAAGTGAT 
GCTGACGATCCCTCAAGTGGTAATAATGCTCGTCTCCTCTACAGCTTACTTCAAGGCCAGCC 
ATATTTTTCTGTTGAACCAACAACAGGAGTCATAAGAATATCTTCTAAAATGGATAGAGAAC 
TGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGTCAGCCAGGAGCGTTG 
TCTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAATAAGCCTATATTTAA 
AGAAAGTTTATACCGCTTGACTGTCTCTGAATCTGCACCCACTGGGACTTCTATAGGAACAA 
TCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTACAGCATTGAAGAGGAT 
GATTCGCAAACATTTGACATTATTACTAATCATGAAACTCAAGAAGGAATAGTTATATTAAA 
AAAGAAAGTGGATTTTGAGCACCAGAACCACTACGGTATTAGAGCAAAAGTTAAAAACCATC 
ATGTTCCTGAGCAGCTCATGAAGTACCACACTGAGGCTTCCACCACTTTCATTAAGATCCAG 
GTGGAAGATGTTGATGAGCCTCCTCTTTTCCTCCTTCCATATTATGTATTTGAAGTTTTTGA 
AGAAACCCCACAGGGATCATTTGTAGGCGTGGTGTCTGCCACAGACCCAGACAATAGGAAAT 
CTCCTATCAGGTATTCTATTACTAGGAGCAAAGTGTTCAATATCAATGATAATGGTACAATC 
ACTACAAGTAACTCACTGGATCGTGAAATCAGTGCTTGGTACAACCTAAGTATTACAGCCAC 
AGAAAAATACAATATAGAACAGATCTCTTCGATCCCACTGTATGTGCAAGTTCTTAACATCA 
ATGATCATGCTCCTGAGTTCTCTCAATACTATGAGACTTATGTTTGTGAAAATGCAGGCTCT 
GGTCAGGTAATTCAGACTATCAGTGCAGTGGATAGAGATGAATCCATAGAAGAGCACCATTT 
TTACTTTAATCTATCTGTAGAAGACACTAACAATTCAAGTTTTACAATCATAGATAATCAAG 
ATAACACAGCTGTCATTTTGACTAATAGAACTGGTTTTAACCTTCAAGAAGAACCTGTCTTC 
TACATCTCCATCTTAATTGCCGACAATGGAATCCCGTCACTTACAAGTACAAACACCCTTAC 
CATCCATGTCTGTGACTGTGGTGACAGTGGGAGCACACAGACCTGCCAGTACCAGGAGCTTG 
TGCTTTCCATGGGATTCAAGACAGAAGTTATCATTGCTATTCTCATTTGCATTATGATCATA 
TTTGGGTTTATTTTTTTGACTTTGGGTTTAAAACAACGGAGAAAACAGATTCTATTTCCTGA 
GAAAAGTGAAGATTTCAGAGAGAATATATTCCAATATGATGATGAAGGGGGTGGAGAAGAAG 
ATACAGAGGCCTTTGATATAGCAGAGCTGAGGAGTAGTACCATAATGCGGGAACGCAAGACT 
CGGAAAACCACAAGCGCTGAGATCAGGAGCCTATACAGGCAGTCTTTGCAAGTTGGCCCCGA 
CAGTGCCATATTCAGGAAATTCATTCTGGAAAAGCTCGAAGAAGCTAATACTGATCCGTGTG 
CCCCTCCTTTTGATTCCCTCCAGACCTACGCTTTTGAGGGAACAGGGTCATTAGCTGGATCC 
CTGAGCTCCTTAGAATCAGCAGTCTCTGATCAGGATGAAAGCTATGATTACCTTAATGAGTT 
GGGACCTCGCTTTAAAAGATTAGCATGCATGTTTGGTTCTGCAGTGCAGTCAAATAATTAGG 
GCTTTTTACCATCAAAATTTTTAAAAGTGCTAATGTGTATTCGAACCCAATGGTAGTCTTAA 
AGAGTTTTGTGCCCTGGCTCTATGGCGGGGAAAGCCCTAGTCTATGGAGTTTTCTGATTTCC 
CTGGAGTAAATACTCCATGGTTATTTTAAGCTACCTACATGCTGTCATTGAACAGAGATGTG 
GGGAGAAATGTAAACAATCAGCTCACAGGCATCAATACAACCAGATTTGAAGTAAAATAATG 
TAGGAAGATATTAAAAGTAGATGAGAGGACACAAGATGTAGTCGATCCTTATGCGATTATAT 
CATTATTTACTTAGGAAAGAGTAAAAATACCAAACGAGAAAATTTAAAGGAGCAAAAATTTG 
CAAGTCAAATAGAAATGTACAAATCGAGATAACATTTACATTTCTATCATATTGACATGAAA 
ATTGAAAATGTATAGTCAGAGAAATTTTCATGAATTATTCCATGAAGTATTGTTTCCTTTAT 
TTAAA 



FIGURE 95 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53906 
xsubunit 1 of 1, 772 aa, 1 stop 
><MW: 87002, pi: 4.64, NX{S/T): 8 

MNCYLLLRFMLGIPLLWPCLGATENSQTKKVKQPVRSHLRVKRGWVWNQFFVPEEMNTTSHH 
IGQLRSDLDNGNNSFQYKLLGAGAGSTFIIDERTGDIYAIQKLDREERSLYILRAQVIDIAT 
GRAVEPESEFVIKVSD INDNEPKFLDEPYEAI VPEMS PEGTLVI QVTASDADDPSSGNNARL 
LYSLLQGQPYFSVEPTTGVIRI S SKMDRELQDEYWVI IQAKDMIGQPGALSGTTSVLIKLSD 
VNDNKP I FKESLYRLTVSE SAPTGTS IGT IMAYDND IGENAEMDYS I EEDDSQTFD 1 1 TNHE 
TQEGIVILKKKVDFEHQNHYGIRAKVXNHHVPEQLMKYHTEASTTFIKIQVEDTO 
PYYVFEVFEETPQGSFVGWSATDPDNRKSPIRYSITRSKVFNINDNGTITTSNSLDREISA 
WYNLSITATEKYNIEQISSIPLYVQVLNINDHAPEFSQYYETYVCENAGSGQVIQTISAVDR 
DES I EEHHFYFNLS VEDTNNS SFTI IDNQDNTAVILTNRTGFNLQEE PVFYI S ILIADNGI P 
SLTSTNTLTIHVCDCGDSGSTQTCQYQELVLSMGFKTEVI IAILICIMI IFGFIFLTLGLKQ 
RRKQI LFPEKS EDFREN I FQYDDEGGGEEDTEAFD I AELRS ST IMRERKTRKTTSAE IRS LY 
RQSLQVGPDSAIFRKFILEKLEEANTDPCAPPFDSLQTYAFEGTGSLAGSLSSLESAVSDQD 
ESYDYLNELGPRFKRLACMFGSAVQSNN 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 597-617 

N-glycosylation sites. 

amino acids 57-60, 74-77, 419-423, 437-440, 508-511, 515-518, 
516-519 and 534-537 

Cadherins extracellular repeated domain signature . 

amino acids 13 6-146 and 244-254 



ATTTCAAGGCCAGCCATATTTTTNTGTTGAACCAACAACAGGAGTCATAAGAATATTTTNTA 
AAATGGATAGAGAACTGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGT 
CAGCCAGGAGCGTTGTNTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAA 
TAAGCCTATATTTAAAGAAAGTTTATACCGCTTGACTGTNTNTGAATCTGCACCCACTGGGA 
NTTNTATAGGAACAATCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTAC 
AGCATTGAAGAGGATGATTCGCAAA.CATTTGACATTATT 



FIGURE 97 



GCAACCTCAGCTTCTAGTATCCAGACTCCAGCGCCGCCCCGGGCGCGGACCCCAACCCCGAC 
CCAGAGCTTCTCCAGCGGCGGCGCAGCGAGCAGGGCTCCCCGCCTTAACTTCCTCCGCGGGG 
CCCAGCCACCTTCGGGAGTCCGGGTTGCCCACCTGCAAACTCTCCGCCTTCTGCACCTGCCA 
CCCCTGAGCCAGCGCGGGCCCCCGAGCGAGTCATGGCCAACGCGGGGCTGCAGCTGTTGGGC 
TTCATTCTCGCCTTCCTGGGATGGATCGGCGCCATCGTCAGCACTGCCCTGCCCCAGTGGAG 
GATTTACTCCTATGCCGGCGACAACATCGTGACCGCCCAGGCCATGTACGAGGGGCTGTGGA 
TGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAAT 
CTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGAT 
AGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGC 
AGAAGATGAGGATGGCTGTCATTGGGGGTGCGATATTTCTTCTTGCAGGTCTGGCTATTTTA 
GTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCCCAGT 
CAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCC 
TTCTGGGAGGTGCCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCAACACCA 
AGGCCCTATCCAAAACCTGCACCTTCCAGCGGGAAAGACTACGT GTGA CACAGAGGCAAAAG 
GAGAAAATCATGTTGAAACAAACCGAAAATGGACATTGAGATACTATCATTAACATTAGGAC 
CTTAGAATTTTGGGTATTGTAATCTGAAGTATGGTATTACAAAACAAACAAACAAACAAAAA 
ACCCATGTGTTAAAATACTCAGTGCTAAACATGGCTTAATCTTATTTTATCTTCTTTCCTCA 
ATATAGGAGGGAAGATTTTTCCATTTGTATTACTGCTTCCCATTGAGTAATCATACTCAAAT 
GGGGGAAGGGGTGCTCCTTAAATATATATAGATATGTATATATACATGTTTTTCTATTAAAA 
ATAGACAGTAAAATACTATTCTCATTATGTTGATACTAGCATACTTAAAATATCTCTAAAAT 
AGGTAAATGTATTTAATTCCATATTGATGAAGATGTTTATTGGTATATTTTCTTTTTCGTCC 
TTATATACATATGTAACAGTCAAATATCATTTACTCTTCTTCATTAGCTTTGGGTGCCTTTG 
CCACAAGACCTAGCCTAATTTACCAAGGATGAATTCTTTCAATTCTTCATGCGTGCCCTTTT 
CATATACTTATTTTATTTTTTACCATAATCTTATAGCACTTGCATCGTTATTAAGCCCTTAT 
TTGTTTTGTGTTTCATTGGTCTCTATCTCCTGAATCTAACACATTTCATAGCCTACATTTTA 
GTTTCTAAAGCCAAGAAGAATTTATTACAAATCAGAACTTTGGAGGCAAATCTTTCTGCATG 
ACCAAAGTGATAAATTCCTGTTGACCTTCCCACACAATCCCTGTACTCTGACCCATAGCACT 
CTTGTTTGCTTTGAAAATATTTGTCCAATTGAGTAGCTGCATGCTGTTCCCCCAGGTGTTGT 
AACACAACTTTATTGATTGAATTTTTAAGCTACTTATTCATAGTTTTATATCCCCCTAAACT 
ACCTTTTTGTTCCCCATTCCTTAATTGTATTGTTTTCCCAAGTGTAATTATCATGCGTTTTA 
TATCTTCCTAATAAGGTGTGGTCTGTTTGTCTGAACAAAGTGCTAGACTTTCTGGAGTGATA 
ATCTGGTGACAAATATTCTCTCTGTAGCTGTAAGCAAGTCACTTAATCTTTCTACCTCTTTT 
TTCTATCTGCCAAATTGAGATAATGATACTTAACCAGTTAGAAGAGGTAGTGTGAATATTAA 
TTAGTTTATATTACTCTTATTCTTTGAACATGAACTATGCCTATGTAGTGTCTTTATTTGCT 
CAGCTGGCTGAGACACTGAAGAAGTCACTGAACAAAACCTACACACGTACCTTCATGTGATT 
CACTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACACACATACCTTCAT 
GTGGTTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACGCACATAC 
CTTCATGTGGCTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCATTCTTTCAGCTGTGTCT 
GACATGTTTGTGCTCTGTTCCATTTTAACAACTGCTCTTACTTTTCCAGTCTGTACAGAATG 
CTATTTCACTTGAGCAAGATGATGTAATGGAAAGGGTGTTGGCACTGGTGTCTGGAGACCTG 
GATTTGAGTCTTGGTGCTATCAATCACCGTCTGTGTTTGAGCAAGGCATTTGGCTGCTGTAA 
GCTTATTGCTTCATCTGTAAGCGGTGGTTTGTAATTCCTGATCTTCCCACCTCACAGTGATG 
TTGTGGGGATCCAGTGAGATAGAATACATGTAAGTGTGGTTTTGTAATTTAAAAAGTGCTAT 
ACTAAGGGAAAGAATTGAGGAATTAACTGCATACGTTTTGGTGTTGCTTTTCAAATGTTTGA 
AAATAAAAAAAATGTTAAG 



FIGURE 98 



>< /us r / s eqdb2 / s s t /DNA/Dnas eqs . min/ ss . DNA5 2185 
xsubunit 1 of 1, 211 aa, 1 stop 
><MW: 22744, pi: 8.51, NX(S/T): 1 

MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLVMSCVSQSTGQI 
QCKVFDSLLNLSSTLQATRALMWGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAVIGGA 
IFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGALLCCSC 
PRieTTSYPTPRPYPKPAPSSGKDYV 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-102, 118-142 and 161-187 

N-glycosylation site. 

amino acids 72-75 

PMP-22 / EMP / MP20 family proteins 
amino acids 70-111 

ABC- 2 type transport system integral membrane protein 

amino acids 119-133 



FIGURE 99 



TTCTGGCCAAACCCGGGGCTNCAGCTGTTGGGCTTCATCTCGCCTTCCTGGGATGGATCGGC 
GCCATCNTCACACTGCCCTTCCCCAGTGGAGGATTTTACTCCCTATGCTGGCGACAACATCG 
TGACCGCCCAGCCCATGTACGAGGGCJCTGTGGATGTCCNGCGTGTCGCAGAGCACCGGGCAG 
ATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCCGTGC 
CTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGA 
AGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGC 
GCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAAN 
CNTTCAACANTTCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCA 
GGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCT 
GTTCCTGTCCC 



FIGURE 1QQ 



ACCCTTGACCCAACGCGGCCCCCCGACCGNTTCATGGCCAAACGCGGGNCTCCAGCTGTTGG 
GCTTCATTCTCCCCTTCCTGGGATGGACCGGCGCCCATCNTCAGCACTGCCCTGCCCCAGTG 
GAGGATTTACTCCTATNCCGGCNACAACATCGTGACCGCCCAGGCCNTGTACGAGGGGCTGT 
GGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCCTTGCT 
GAATCTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAG 
TGATAGCAATCTTNNTGGCCACCGTTGTNNNTGAAGTGTATGAAGTGCTTGGAAGACGATGA 
GGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTA 
TTTTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCGA 



FIGURE 101 



GGGCCCGACCATTATCCAACCGGGNTCACTGTTGGCTCATCTCCCTCCTGGATGAANCGCGC 
CATCNTCAGACTCCCTGCCCCATGGAGATTTNNCCTATGCTGGCGACAACATCNTGACCCCC 
AGCCATGTACGAGGGGCTTTGAACGTCNGCGTGTCGCAGANCACCGGGCAGATCCAGTGCAA 
AGTCTTTGACTCCTTGCTGAATCTGNGCAGCACATTGCAGCAACCCNTGCCCTGATGGTGGT 
TGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGT 
GCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTT 
CTTGCAGGTCTGGCTATTTNNNGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAAT 
TCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGC 
TGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGA 



FIGURE 102 



ATTCTCCCCTCCTGGATGGATCGCNCCACCGTCACATTGCCTTCCCCCANTGGAGGATTNAC 
TCCTATGCTGGCGACAACATCGTGACCCCCCAGGCCATTTACCGAGGGGCTTTGGATGTCNT 
GCNTGTCGCAGAGCACCGGGCAGATCCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAG 
CAGCACATTGCAAGCAACCCGTGCCTTGATGGGGTTGGCATCCTCCTGGGAGTGATAGCAAC 
CTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCCAGAAG 
ATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTGTTGCAGGTCTGGCTATTTTAGTNGC 
CACAGCATGGTATGGCAATAGAl^NNTTCNNGNNNTCTATGACCCTATGACCCCAGTCAATG 
CCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTG 
GGAGGTGCCCTACTTTGCTGTTCCTGTCCC 



FIGURE 103 



AGAGCACCGGCAGATCCCAGTNCAAA.GTCTTTGACCCTTGCTGAATCTGAGCAGCACATTNC 
AAGCAACCCCTTGCCTTGAAGGTGGTTGNCATCCCCCCTGGGAGTGAATAGCAATCTTTGTG 
GCCACCGTTGGCATGAAGTNTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTNNCCACAGCAT 
GGTATGGCAATAGNATNNTTCGNGGNTTCTATGACCCTATGACCCCAGTCAATGCCAGGTAC 
GAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGC 
CCTACTTTGCTGTTCCTGTCCCCGAA 



FIGURE 104 



AGCAATGCCCTGCCCCCAGTGGAGGATTAATTCCTATGNTGGGGACAACATTGTGACNGCCC 
AGGCCATGTACGGGGGGCTGTGGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGC 
AAAGTNTTTGACTCCTTGCTGAATTTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGT 
GGTTGGCATCTTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTGGNAATGAAGTGTATGA 
AGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTT 
CTTNTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAATNGTTCAAGA 
ATTTTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTTTNTTCACTG 
GCTGGGCTGCTGCTTNTTTCTGCCTTNTGGGAGGTGCCCTANTTTGCTGTTCCTGCGAACC 



FIGURE 105 



TCATAGGGGGGCGCGATATTTTTTCTTGCAGGTNTGGTTATTTTAGTTGCCACAGCATGGTA 
TGGCAATAGAATCGTTCAAGAATTKrTATGACCCTATGACCCCAGTCAATGCCAGGTACGAAT 
TTGGTCAGGCTCTNTTCACTGGNTGGGCTGCTGCTTCTNTNNGCCTTNTGGGAGGTGCCCTA 
CTTTGCTGTTCCTG 



TTCCTGGGATGGATCCGCCCCCATCNTCACATGCCCTGCCCCNTGGAGATTTACNCCTATGC 
TGGCGAACAACATCNTGACCGCCCAGGCCATGTACGAGGGGCTGTGGAATGTCCTGCGTGTC 
CCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACAT 
TGCAAGCAACCNTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGG 
CCACCGTTGGCATGAAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGNNGCCACAGCAT 
GGTATGGCAATCAGACCCNNTCANAAACTCTATGACCCTATGACCCCAGTCAATGCCAGGTA 
CGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTG 
CCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCACG 



FIGURE 107 



CGGGGCTGCAGCTGTTGGGCTTCATCTCGCTTCCTGGGATGGAATCGGCGCCATCGTCAGCA 
CTGCCCTGCCCCATGGAGGATTTACTCNTATGCTGGCGACAACATCGTGACCNCCCAGGCCA 
TGTACGAGGGGCTGTGGATGTCNGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCT 
TTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCNTGCCTTGATGGTGGTTGGCA 
TCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTG 
GAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGC 
AGGTCTGGCTATTTNTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTAT 
GACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGC 
TGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGAA 



FIGURE 108 

GCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCAC 
CGCGTAGACCGACCCCCCCCTCCAGCGCGCCCACCCGGTAGAGGACCCCCGCCCGTGCCCCG 
ACCGGTCCCCGCCTTTTTGTAAAACTTAAAGCGGGCGCAGCATTAACGCTTCCCGCCCCGGT 
GACCTCTCAGGGGTCTCCCCGCCAAAGGTGCTCCGCCGCTAAGGAACATGGCGAAGGTGGAG 
CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGGTCCCTTCACCGATGTTGT 
CACCACCAACCTAAAGCTTGGCAACCCGACAGACCGAAATGTGTGTTTTAAGGTGAAGACTA 
CAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATT 
AATGTATCTGTGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTT 
TATGGTTCAGTCTATGTTTGCTCCAACTGACACTTCAGATATGGAAGCAGTATGGAAGGAGG 
CAAAACCGGAAGACCTTATGGATTCAAAACTTAGATGTGTGTTTGAATTGCCAGCAGAGAAT 
GATAAACCACATGATGTAGAAATAAATAAAATTATATCCACAACTGCATCAAAGACAGAAAC 
ACCAATAGTGTCTAAGTCTCTGAGTTCTTCTTTGGATGACACCGAAGTTAAGAAGGTTATGG 
AAGAATGTAAGAGGCTGCAAGGTGAAGTTCAGAGGCTACGGGAGGAGAACAAGCAGTTCAAG 
GAAGAAGATGGACTGCGGATGAGGAAGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGC 
CCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTA 
TCGTTGGTGTAATTATTGGGAAGATTGCCTTGTAGAGGTAGCATGCACAGGATGGTAAATTG 
GATTGGTGGATCCACCATATCATGGGATTTAAATTTATCATAACCATGTGTAAAAAGAAATT 
AATGTATGATGACATCTCACAGGTCTTGCCTTTAAATTACCCCTCCCTGCACACACATACAC 
AGATACACACACACAAATATAATGTAACGATCTTTTAGAAAGTTAAAAATGTATAGTAACTG 
ATTGAGGGGGAAAAAGAATGATCTTTATTAATGACAAGGGAAACCATGAGTAATGCCACAAT 
GGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTC 
TCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCAT 
GCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTG 
CTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGA 
AGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGT 
TGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAA 
GCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATA 
TTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGC 
TGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCT 
GTCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCA 
GGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTAGTG 
ACAGTCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATA 
CCACCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCC 
TTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGC 
ACCAGCAGTTGTGGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTAC 
TAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATAAAAGACCAACCCAGTTCTGTTTGA 
CTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCCCCAAAATTAAGAAAA 



FIGURE 109 

</usr/ seqdb2/ sst/DNA/Dnaseqs . min/ss . DNA53977 
<subunit 1 of 1, 243 aa, 1 stop 
<MW: 27228, pi: 7.43, NX(S/T): 2 

MMCVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIID 
AGASI1WSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAWKEAKPEDLMDSKLRCVFE 
LPAENDKPHDVE INKI I STTASKTETP I VS KS £iS S S LDDTEVKKVMEECKRLQGEVQRLREE 
NKQFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGKIAL 

Important features: 
Transmembrane domain: 

amino acids 224-239 

N-glycosylation site. 

amino acids 68-71 

N-myristoylation site. 

amino acids 59-64, 64-69 and 235-240 



FIGURE 110 



GTCAGTCTTCTAGATTGTCCTTATCCCACCTTTCAACCANTACTCACATTTCNAGCGCCCAG 
GTCCANGTCTGAGCCTGACTTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAATGGNTCGGG 
CTGCAGAGGNTTAGAAGCGAGGGCACCAGCAGTTTTGGGTGGGGAGCAAGGGNNGAGAGAAA 
CTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATA 
AAAGACNAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCC 
CCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTT 
ATCATTTTTATTATTTTGCCATTGGAAGGTTAACTTTAAAATGAGC 



FIGURE 111 



TATTGTAAAGGCCATTTTAAACCATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCCTT 
AAATGACACCNTTCCTCGCCTGTTGGTGCTGGCCNTTGGGGAGCTGGAGCCCCAGCATGCTG 
GGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTT 
CCGTGTCTTCAGTTCTGTCCAAGCC^TCAGCTCCTTGGGACTGATGAACAGAGTCAGAAGCC 
CAAAGGAATTGCCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGA 
CTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAAGCT 
AAATTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTA 
ACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGCTGCG 
TGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTG 



FIGURE 112 

CCCTGGTGGTTTTGTTCTTTAATTCGTTGGTGTAATTNTTGGGAAGATTGCTTGTAGAGGTA 
GNATGCACCNGGCTGGTAAATTGGATTGGTGGATCCACCATATCCATGGGATTTAAATTTAT 
CATAACCATGTGTAAAAAGAAATTAATGTATGATGACATNTCACAGGTATTGCCTTTAAATT 
ACCCATCCCTGNANACACATACACAGATACACANANACAAATNTAATGTAACGATNTTTTAG 
AAAGTTAAAAATGTATAGTAAC 



FIGURE 113 



GGTGGCCCATTCCCGGCCCAGGCTGCTTTCCGGTNTTCAGTTCTGTCCAAGCCATCAGCTCC 
TTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATNAGACGTAC 
TTGTNATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGT 
GCTTTGTTCANTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTG 
TTATTCAGAGATGTTTAATGCATATTTAANTTATTTAATGTATTTNATNTCATGTTTTCTTA 
TTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAANTNTGTTGGGTGAACTGGTATTGC 
TGCTGGAGGGCTGTGGGCTCCTCTGTCTTTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 114 

TGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTTGATGAACAGAGTC 
AGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTG 
TGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGAC 
CAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGC 
ATATTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAA 
TGCTGCGTGC 



FIGURE 115 



AAACCTTTAAAAGTTGAGGGGAAAAGAATGATCCTTTATTAATGACAAGGGAAACCNTGNGT 
AATGCCACAATGGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGC 
TGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTN 
GAGCCCAGCATGCTGGGGAGTGCGGTCTGCTCCACACAGTAGTCCCCANGTGGCCCANTCCC 
GGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGANTGATGA 
ACAGAGTCAGAAGCCCAAAGGAATTGCANTGTGGCAGCATCAGANGTANTNGTCATAAGTGA 
GAGGCGTGTGTTGANTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCANTT 
AAAGGGNCCAAGNTAAATTTGTATTGGTTCATGTAGTGAAGTCAAANTGTTATTCAGAGATG 
TTTAATGCATATTTAANTTATTTAA.TGTATTTCATNTCATGTTTTCTTATTGTCACAAGGGT 
ACAGTTAATGCTGCGTGCTGCTGAAWTCTGTTGGGTGAANTGGTATTGCTG 



FIGURE 116 



GGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCC 
CACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGC 
TCCTTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACG 
TACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGC 
AGTGCTTTGTTCACTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAA 
CTGTTATTCAGAGATGTTTAATGCA.TATTTAACTTATTTAATGTATTTCATCTCATGTTTTC 
TTATTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTAT 
TGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 117 



GCGAGCTCCGGGTGCTGTGGCCCGGCCTTGGCGGGGCGGCCTCCGGCTCAGGCTGGCTGAGA 
GGCTCCCAGCTGCAGCGTCCCCGCCCGCCTCCTCGGGAGCTCTGATCTCAGCTGACAGTGCC 
CTCGGGGACCAAACAAGCCTGGCAGGGTCTCACTTTGTTGCCCAGGCTGGAGTTCAGTGCCA 
TGATCATGGTTTACTGCAGCCTTGACCTCCTGGGTTCAAGCGATCCTGCTGAGTAGCTGGGA 
CTACAGGACAAAATTAGAAGATCAAAATGGAAAATATGCTGCTTTGGTTGATATTTTTCACC 
CCTGGGTGGACCCTCATTGATGGATCTGAAATGGAATGGGATTTTATGTGGCACTTGAGAAA 
GGTACCCCGGATTGTCAGTGAAAGGACTTTCCATCTCACCAGCCCCGCATTTGAGGCAGATG 
CTAAGATGATGGTAAATACAGTGTGTGGCATCGAATGCCAGAAAGAACTCCCAACTCCCAGC 
CTTTCTGAATTGGAGGATTATCTTTCCTATGAGACTGTCTTTGAGAATGGCACCCGAACCTT 
AACCAGGGTGAAAGTTCAAGATTTGGTTCTTGAGCCGACTCAAAATATCACCACAAAGGGAG 
TATCTGTTAGGAGAAAGAGACAGGTGTATGGCACCGACAGCAGGTTCAGCATCTTGGACAAA 
AGGTTCTTAACCAATTTCCCTTTCAGCACAGCTGTGAAGCTTTCCACGGGCTGTAGTGGCAT 
TCTCATTTCCCCTCAGCATGTTCTAACTGCTGCCCACTGTGTTCATGATGGAAAGGACTATG 
TCAAAGGGAGTAAAAAGCTAAGGGTAGGGTTGTTGAAGATGAGGAATAAAAGTGGAGGCAAG 
AAACGTCGAGGTTCTAAGAGGAGCAGGAGAGAAGCTAGTGGTGGTGACCAAAGAGAGGGTAC 
CAGAGAGCATCTGCAGGAGAGAGCGAAGGGTGGGAGAAGAAGAAAAAAATCTGGCCGGGGTC 
AGAGGATTGCCGAAGGGAGGCCTTCCTTTCAGTGGACCCGGGTCAAGAATACCCACATTCCG 
AAGGGCTGGGCACGAGGAGGCATGGGGGACGCTACCTTGGACTATGACTATGCTCTTCTGGA 
GCTGAAGCGTGCTCACAAAAAGAAATACATGGAACTTGGAATCAGCCCAACGATCAAGAAAA 
TGCCTGGTGGAATGATCCACTTCTCAGGATTTGATAACGATAGGGCTGATCAGTTGGTCTAT 
CGGTTTTGCAGTGTGTCCGACGAATCCAATGATCTCCTTTACCAATACTGCGATGCTGAGTC 
GGGCTCCACCGGTTCGGGGGTCTATCTGCGTCTGAAAGATCCAGACAAAAAGAATTGGAAGC 
GCAAAATCATTGCGGTCTACTCAGGGCACCAGTGGGTGGATGTCCACGGGGTTCAGAAGGAC 
TACAACGTTGCTGTTCGCATCACTCCCCTAAAATACGCCCAGATTTGCCTCTGGATTCACGG 
GAACGATGCCAATTGTGCTTACGGCTAACAGAGACCTGAAACAGGGCGGTGTATCATCTAAA 
TCACAGAGAAAACCAGCTCTGCTTACCGTAGTGAGATCACTTCATAGGTTATGCCTGGACTT 
GAACTCTGTCAATAGCATTTCAACATTTTTCAAAATCAGGAGATTTTCGTCCATTTAAAAAA 
TGTATAGGTGCAGATATTGAAACTAGGTGGGCACTTCAATGCCAAGTATATACTCTTCTTTA 
CATGGTGATGAGTTTCATTTGTAGA?^AAATTTTGTTGCCTTCTTAAAAATTAGACACACTTT 
AAACCTTCAAACAGGTATTATAAATAACATGTGACTCCTTAATGGACTTATTCTCAGGGTCC 
TACTCTAAGAAGAATCTAATAGGATGCTGGTTGTGTATTAAATGTGAAATTGCATAGATAAA 
GGTAGATGGTAAAGCAATTAGTATCAGAATAGAGACAGAAAGTTACAACACAGTTTGTACTA 
CTCTGAGATGGATCCATTCAGCTCATGCCCTCAATGTTTATATTGTGTTATCTGTTGGGTCT 
GGGACATTTAGTTTAGTTTTTTTGAAGAATTACAAATCAGAAGAAAAAGCAAGCATTATAAA 
CAAAACTAATAACTGTTTTACTGCTTTAAGAAATAACAATTACAATGTGTATTATTTAAAAA 
TGGGAGAAATAGTTTGTTCTATGAAATAAACCTAGTTTAGAAATAGGGAAGCTGAGACATTT 
TAAGATCTCAAGTTTTTATTTAACTAATACTCAAAATATGGACTTTTCATGTATGCATAGGG 
AAGACACTTCACAAATTATGAATGATCATGTGTTGAAAGCCACATTATTTTATGCTATACAT 
TCTATGTATGAGGTGCTACATTTTTAGGACAAAGAATTCTGTAATCTTTTTCAAGAAAGAGT 
CTTTTTCTCCTTGACAAAATCCAGCTTTTGTATGAGGACTATAGGGTGAATTCTCTGATTAG 
TAATTTTAGATATGTCCTTTCCTAAAAATGAATAAAATTTATGAATATGA 



FIGURE 118 



</usr/segdb2/sst/DNA/Dnaseqs.min/ss.DNA57253 
<subunit 1 of 1, 413 aa, l stop 
<MW: 47070, pi: 9.92, NX(S/T): 3 

MEl^LLWLIFFTPGWTLIDGSEMEWDFMWHLRKVPRIVSERTFHLTSPAFEADAKMIWNTVC 
GIECQKELPTPSLSELEDYLSYETVFENGTRTLTRVKVQDLVLEPTQNITTKGVSVRRKRQV 
YGTDSRFSILDKRFLTNFPFSTAVKLSTGCSGILISPQHVLTAAHCVHDGKDYVKGSKKLRV 
GLLKMRNKSGGKKRRGSKRSRREASGGDQREGTREHLQERAKGGRRRKKSGRGQRIAEGRPS 
FQWTRVKNTHIPKGWARGGMGDATLDYDYALLELKRAHKKKYMELGISPTIKKMPGGMIHFS 
GFDJSTORADQLVYRFCSVSDESNDLLYQYCDAESGSTGSGWLRLKDPDKKNWKRKIIAVYSG 
HQWVDVHGVQKDYNVAVRITPLKYAQICLWIHGNDANCAYG 

Important features: 
Signal peptide: 
amino acids 1-16 

N-glycosylation sites. 

amino acids 90-93, 110-113 and 193-196 

Glycosaminoglycan attachment site. 

amino acids 236-239 

Serine proteases, trypsin family, histidine active site. 

amino acids 165-170 



FIGURE 119 



AATGTGAGAGGGGCTGATGGAAGCTGATAGGCAGGACTGGAGTGTTAGCACCAGTACTGGAT 
GTGACAGCAGGCAGAGGAGCACTTAGCAGCTTATTCAGTGTCCGATTCTGATTCCGGCAAGG 
ATCCAAGCATGGAATGCTGCCGTCGGGCAACTCCTGGCACACTGCTCCTCTTTCTGGCTTTC 
CTGCTCCTGAGTTCCAGGACCGCACGCTCCGAGGAGGACCGGGACGGCCTATGGGATGCCTG 
GGGCCCATGGAGTGAATGCTCACGCACCTGCGGGGGAGGGGCCTCCTACTCTCTGAGGCGCT 
GCCTGAGCAGCAAGAGCTGTGAAGGAAGAAATATCCGATACAGAACATGCAGTAATGTGGAC 
TGCCCACCAGAAGCAGGTGATTTCCGAGCTCAGCAATGCTCAGCTCATAATGATGTCAAGCA 
CCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGACAACCCATGTTCACTCA 
AGTGCCAAGCCAAAGGAACAACCCTGGTTGTTGAACTAGCACCTAAGGTCTTAGATGGTACG 
CGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTATGCCAAATTGTTGGCTGCGA 
TCACCAGCTGGGAAGCACCGTCAAGGAAGATAACTGTGGGGTCTGCAACGGAGATGGGTCCA 
CCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTCTCCGCAACCAAATCGGATGATACT 
GTGGTTGCACTTCCCTATGGAAGTAGACATATTCGCCTTGTCTTAAAAGGTCCTGATCACTT 
ATATCTGGAAACCAAAACCCTCCAGC3GGACTAAAGGTGAAAACAGTCTCAGCTCCACAGGAA 
CTTTCCTTGTGGACAATTCTAGTGTGGACTTCCAGAAATTTCCAGACAAAGAGATACTGAGA 
ATGGCTGGACCACTCACAGCAGATTTCATTGTCAAGATTCGTAACTCGGGCTCCGCTGACAG 
TACAGTCCAGTTCATCTTCTATCAACCCATCATCCACCGATGGAGGGAGACGGATTTCTTTC 
CTTGCTCAGCAACCTGTGGAGGAGGTTATCAGCTGACATCGGCTGAGTGCTACGATCTGAGG 
AGGAACCGTGTGGTTGCTGACCAATACTGTGACTATTACCCAGAGAACATCAAACCCAAACC 
CAAGCTTCAGGAGTGCAACTTGGATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGC 
CTTATGACCTCTACCATCCCCTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCC 
TCGTGTGGGGGGGGCATCCAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCA 
TGTCACTTCAGTGGAAGAGTGGAAATGCATGTACACCCCTAAGATGCCCATCGCGCAGCCCT 
GCAACATTTTTGACTGCCCTAAATGGCTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGT 
GGCCAGGGCCTCAGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGG 
CTGTAGCCCAAAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATA 
AACCCAAAGAGAAACTTCCAGTCGAGGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTA 
GAAGAAGGAGCTGCTGTGTCAGAGGAGCCCTCGTAAGTTGTAAAAGCACAGACTGTTCTATA 
TTTGAAACTGTTTTGTTTAAAGAAAGCAGTGTCTCACTGGTTGTAGCTTTCATGGGTTCTGA 
ACTAAGTGTAATCATCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 
AAAAAAAAA 



FIGURE 120 



</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58847 
<subunit 1 of 1, 525 aa, 1 stop 
<MW: 58416, pi: 6.62, NX(S/T): 1 

MECCRRATPGTLLLFLAFLLLSSRTARSEEDRDGLWDAWGPWSECSRTCGGGASYSLRRCIiS 
SKSCEGRNIRYRTCSlSrVDCPPEAGDFRAQQCSAHNDVKHHGQFYEWLPVSNDPDNPCSLKCQ 
AKGTTLWELAPKYLDGTRCYTESLDMCISGLCQIVGCDHQLGSTVKEDNCGVCNGDGSTCR 
LVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENSLSSTGTFL 
VDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHRWRETDFFPCS 
ATCGGGYQLTSAECYDLRSNRWADQYCHYYPENIKPKPKLQECNLDPCPASDGYKQIMPYD 
LYHPLPRWEATPWTACSSSCGGGI QSRAVS CVEED IQGHVTS VEEWKCMYTPKMP I AQPCNI 
FDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKPHIKEECIVPTPCYKPK 
EKLPVEAKLPWFKQAQELEEGAAVSEEPS 

Important features: 
Signal peptide: 

amino acids 1-25 

N-glycosylation site. 

amino acids 251-254 

Thrombospondin 1 

amino acids 385-399 

von Willebrand factor type C domain proteins 

amino acids 385-399, 445-459 and 42-56 



FIGURE 121 



CGGACGCGTGGGCGGCGGCTGCGGAACTCCCGTGGAGGGGCCGGTGGGCCCTCGGGCCTGAC 
AGATGGCAGTGGCCACTGCGGCGGCAGTACTGGCCGCTCTGGGCGGGGCGCTGTGGCTGGCG 
GCCCGCCGGTTCGTGGGGCCCAGGGTCCAGCGGCTGCGCAGAGGCGGGGACCCCGGCCTCAT 
GCACGGGAAGACTGTGCTGATCACCGGGGCGAACAGCGGCCTGGGCCGCGCCACGGCCGCCG 
AGCTACTGCGCCTGGGAGCGCGGGTGATCATGGGCTGCCGGGACCGCGCGCGCGCCGAGGAG 
GCGGCGGGTCAGCTCCGCCGCGAGCTCCGCCAGGCCGCGGAGTGCGGCCCAGAGCCTGGCGT 
CAGCGGGGTGGGCGAGCTCATAGTCCGGGAGCTGGACCTCGCCTCGCTGCGCTCGGTGCGCG 
CCTTCTGCCAGGAAATGCTCCAGGAAGAGCCTAGGCTGGATGTCTTGATCAATAACGCAGGG 
ATCTTCCAGTGCCCTTACATGAAGACTGAAGATGGGTTTGAGATGCAGTTCGGAGTGAACCA 
TCTGGGGCACTTTCTACTCACCAATCTTCTCCTTGGACTCCTCAAAAGTTCAGCTCCCAGCA 
GGATTGTGGTAGTTTCTTCCAAACTTTATAAATACGGAGACATCAATTTTGATGACTTGAAC 
AGTGAACAAAGCTATAATAAAAGCTTTTGTTATAGCCGGAGCAAACTGGCTAACATTCTTTT 
TACCAGGGAACTAGCCCGCCGCTTAGAAGGCACAAATGTCACCGTCAATGTGTTGCATCCTG 
GTATTGTACGGACAAATCTGGGGAGGCACATACACATTCCACTGTTGGTCAAACCACTCTTC 
AATTTGGTGTCATGGGCTTTTTTCAAAACTCCAGTAGAAGGTGCCCAGACTTCCATTTATTT 
GGCCTCTTCACCTGAGGTAGAAGGAGTGTCAGGAAGATACTTTGGGGATTGTAAAGAGGAAG 
AACTGTTGCCCAAAGCTATGGATGAATCTGTTGCAAGAAAACTCTGGGATATCAGTGAAGTG 
ATGGTTGGCCTGCTAAAATAGGAAC.AAGGAGTAAAAGAGCTGTTTATAAAACTGCATATCAG 
TTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACTTGAAGAAAAAGAATTTTG 
ATATTGGAATAGCCTGCTAAGAGGTACATGTGGGTATTTTGGAGTTACTGAAAAATTATTTT 
TGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGTAAGTATAATGAATAATAA 
GTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCAAGCATGGATGACATATTA 
ATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTTTTCAAGTATCTTTGAGTT 
TCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTTTGTGTGGAAATTATCTGC 
CTGGTGTGTGCACACAAGTCTTACTTGGAATAAATTTACTGGTAC 



FIGURE 122 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA58747 
<subunit 1 of 1, 336 aa, 1 stop 
<MW: 36865, pi: 9.15, NX(S/T): 2 

MAVATAAAVLAALGGALWLAARRFVGPRVQRLRRGGDPGLmGKTVLITGANSGLGRATAAE 
LLRLGARVIMGCRDRARAEEAAGQLRRELRQAAECGPEPGVSGVGELIVRELDLASLRSVRA 
FCQEMLQEEPRLDVLINNAGIFQCPTfMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKSSAPSR 
IVVVSSKLYKYGDINFDDLNSEQSYNKSFCT 

IVRTNLGRHIHI PLLVKPLFNLVSWAFFKTPVEGAQTS I YLASS PEVEGVSGRYFGDCKEEE 
LLPKAMDESVARKLWDI SEVMVGLLK 

Important features: 
Signal peptide: 

amino acids 1-21 

Short-chain alcohol dehydrogenase family protein 

amino acids 134-144, 44-56 and 239-248 

N-glycosylation site. 

amino acids 212-215 and 239-242 



FIGURE 123 



GGGGATTGTAAAGAGGAAGNACTGTGCCCAAAGNTATGGATGAATCTGTTGCAAGAAAATTN 
TGGGATATCAGTGAAGTGATGGTTNGCCTGCTAAAATAGGAACAAGGAGTAAAAGAGCTGTT 
TATAAAACTGCATATCAGTTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACT 
TGAAGAAAAAGAATTTTGATATTGG.AATAGCCTGNTAAGAGGNACATGTGGGTATTTTGGAG 
TTACTGAAAAATTATTTTTGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGT 
AAGTATAATGAATAATAAGTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCA 
AGCATGGATGACATATTAATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTT 
TTCAAGTATCTTTGAGTTTCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTT 
TGTGTGGAAATTATCTGCCTGGCTT 



FIGURE 124 



GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCC 
CTTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGAC 
CCCAGCGTTACC ATG CATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCT 
GCTCCTGGTAACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGCTACAGAGA 
ATATAGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGT 
CGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATT 
TCCAAATGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCC 
AGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAG 
AGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGCAACAAAAAAG 
TGACCC<^TTCAAGAAATTCGGGACTTAGCAGAAATCACCACTCTTGATCGCAGCAAAAGAA 
ATATCATTGGATATTTTGAGCAAAAGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCG 
AATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAG 
ATATAGTGGCGACAACATAATCTACAAACCACCAGGGCATTCTGCTCCGGATATGGTGTACT 
TGGGAGCTATGACAAATTTTGATGTGACTTACAATTGGATTCAAGATAAATGTGTTCCTCTT 
GTCCGAGAAATAACATTTGAAAATGGAGAGGAATTGACAGAAGAAGGACTGCCTTTTCTCAT 
ACTCTTTCACATGAAAGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGC 
AATTAATAAGTGAAAAAGGTACAATAAACTTTTTACATGCCGATTGTGACAAATTTAGACAT 
CCTCTTCTGCACATACAGAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAG 
GCATATGTATGTGTTTGGAGACTTCJ^AAGATGTATTAATTCCTGGAAAACTCAAGCAATTCG 
TATTTGACTTACATTCTGGAAAACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGAT 
ACAGCCCCAGGAGAGCAAGCCCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAA 
ACTAGCACCCAGTGAATATAGGTATACTCTATTGAGGGATCGAGATGAGCTTTAAAAACTTG 
AAAAACAGTTTGTAAGCCTTTCAACAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTA 
TATTTTCATAATTCTATGTGTATTTTTATTTTGAATAAACAGAAAGAAATTTAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 125 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA57689 
<subunit 1 of 1, 406 aa, 1 stop 
<MW: 46927, pi: 5.21, NX(S/T): 0 

MHPAVFLS LPDLRCSLLLLVTWVFTPVTTE ITS LATENIDE I LMNADVALVNFYADWCRFSQ 
MLHPI FEEASDVI KEEFPNENQWFARVDCDQHSDIAQRYRI SKYPTLKLFRNGMMMKREYR 
GQRSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDimiVFERVANILH 
DDCAFLSAFGDVS KPERYSGDNI I YKPPGHS APDMVYLGAMTNFDVTYNWIQDKCVPLVRE I 
TFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPLLH 
IQKTPADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPG 
EQAQDVAS S PPES S FQKLAPSEYRYTLLRDRDEL 

Important features: 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticulum targeting sequence. 

amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 

Thioredoxin family proteins 

amino acids 50-66 



FIGURE 12 



ATTAAGGAAGAATTTCCAAATGAAAATCAAGTAGTNTTTGCCAGAGTNGATTGTGATCAGCA 
CTCTGACATAGCCCAGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATG 
GGATGATGATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 



FIGURE 127 



AGAGGCCTCTCTGGAAGTTGTCCCGGGTGTTCGCCGCNGGAGCCCGGGTCGAGAGGACNAGG 
TGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCCCTTTCCTAACCC 
AACCCAACCTAGCCCNGTCCCAGCCGCCAGCGCCTGTCCCTGTCNCGGANCCCAGCGTNACC 
ATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCTCCTGGTAAC 
TTGGGTTTTTACTCCTGTAACAACTGAAATAACNNGTCTTGATACNNAGAATATAGATGAAA 
TTTTAAACNATGCTGATGTGGCTTTAGTCAATTTTTATGCTGACTGGTGTCGTTTCAGTCAG 
ATGTGGCATCCAATTTTTGAGGANGCTTCCGATGTCATTAAGGAAGAATTTCCAAATGAAAA 
TCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCCAGAGATACAGGA 
TAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAGAGAGAATACAGG 
GGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGC 



FIGURE 128 



GCCCACGCGTCCGATGGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCTCA 
CTGCCGCGCTCATCTTCTTCGCCATTTGGCACATTATAGCATTTGATGAGCTGAAGACTGAT 
TACAAGAATCCTATAGACCAGTGTAATACCCTGAATCCCCTTGTACTCCCAGAGTACCTCAT 
CCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGGTCTCAATA 
TGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAGTGGCCCAGGA 
CTCTATGACCCTACAACCATCATGAATGCAGATATTCTAGCATATTGTCAGAAGGAAGGATG 
GTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGGCATGATCTATGTTT 
TGGTGAGCTCTTAGAACAACACACAGAAGAATTGGTCCAGTTAAGTGCATGCAAAAAGCCAC 
CAAATGAAGGGATTCTATCCAGCAAGATCCTGTCCAAGAGTAGCCTGTGGAATCTGATCAGT 
TACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACATTTTTGCTTGTGGAAAGACTG 
TTTTCATATGTTATACTCAGATAAAGATTTTAAATGGTATTACGTATAAATTAATATAAAAT 
GATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTCTTAAGGAACAGCCATAATCCTCTG 
AATGATGCATTAATTACTGACTGTCCTAGTACATTGGAAGCTTTTGTTTATAGGAACTTGTA 
GGGCTCATTTTGGTTTCATTGAAACAGTATCTAATTATAAATTAGCTGTAGATATCAGGTGC 
TTCTGATGAAGTGAAAATGTATATCTGACTAGTGGGAAACTTCATGGGTTTCCTCATCTGTC 
ATGTCGATGATTATATATGGATACATTTACAAAAATAAAAAGCGGGAATTTTCCCTTCGCTT 
GAATATTATCCCTGTATATTGCATGAATGAGAGATTTCCCATATTTCCATCAGAGTAATAAA 
TATACTTGCTTTAATTCTTAAGCATAAGTAAACATGATATAAAAATATATGCTGAATTACTT 
GTGAAGAATGCATTTAAAGCTATTTTAAATGTGTTTTTATTTGTAAGACATTACTTATTAAG 
AAATTGGTTATTATGCTTACTGTTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGG 
TACTACAGATTTTCAAAACTGAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGT 
GCAATACAATAAAACTCTGAAATTAAGACTC 



FIGURE 129 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA23330 
<subunit 1 of 1, 144 aa, 1 stop 
<MW: 16699, pi: 5.60, NX(S/T): 0 

MAFTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCOTLNPLVLPEYLIHAFF 
CVMFLCAAEWLTLGLmPLIAYHIWTlYMSRPVMSGPGLYDPTTIMNADIIiAYCQKEGWCKIiA 
FYLLAFFYYLYGMIYVLVSS 

Important features: 
Signal peptide: 

amino acids 1-20 

Type II transmembrane domain: 

amino acids 11-31 



Other transmembrane domain: 

amino acids 57-77 and 123-143 



FIGURE 130 

ATTATAGCATTTGATGAGCTGAAGACTGATTACAAGATCCTATAGACCAGTGTAATACCCTG 
AATCCCCTTGTACTCCCAGAGTACCTCATCCACGCTTTCTTCTGTGTCATGTTTCTTTGTGC 
AGCAGAGTGGCTTACACTGGGTCTCAATATGCCCCTCTTGGCATATCATATTTGGAGGTATA 
TGAGTAGACCAGTGATGAGTGGCCCAGGACTCTATGACCCTACAACCATCATGAATGCAGAT 
ATTCTAGCATATTGTCAGAAGGAAGGATGGTGCAAATTAGCTTTTTATCTTCTAGCATTTTT 
TTACTACCTATATGGCATGATCTATGTTTTGGTGAGCTCTTAGAACAACACACAGAAGAATT 
GGTCCAGTTAAGTGCATGCAAAAAGCCACCAAATGAAGGGATTCTATCCAGCAAGATCCTGT 
CCAAGAGTAGCCTGTGGAATCTGATCAGTTACTTTAAAAAATG 



FIGURE 131 



CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGG 
GAACAAGATGGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGC 
TGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGAC 
TCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACAC 
CTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTC 
AGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACA 
GAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCC 
ATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTC 
CTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGCACAGAGCTTCATAACC 
TCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCC 
AGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAA 
GCAAAATGTCCTATCTGCAAATGAGAAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGA 
GAAAGTGATGGCTTTTTAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCT 
TGTCCTCTCGGTGATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGC 
AGTATGTTCCCTCTGAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAG 
CTAAACAGATATCCAGCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGA 
AGCAGGGCCTCTACCTACAAAAGTGAATCTTGCTCATTCTGAAATTTAAGCATTTTTCTTTT 
AAAAGACAAGTGTAATAGACATCTAAAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCA 
TTGGATATAGGCCTTAAGAAATCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 



FIGURE 132 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA26847 
<subunit 1 of 1, 323 aa, 1 stop 
<MW: 36223, pi: 5.06, NX(S/T): 1 

MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHTYP 
KEEELYACQRGCRLFSICQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFA 
ELRQEQLMSLMPKMHLLFPLTLVRSFWSDMMDSAQSFITSSWTFYLQADDGKIVIFQSKPEI 
QYAPHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVL 
SYMVLLWI CCATVATAVEQYVPSEKLS I YGDLEFMNEQKLNRYPASSLVWRSKTEDHEEAG 
PLPTKVNLAHSEI 

Important features: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site. 

amino acids 90-93 
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TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
TAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTTG 
TGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
TGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAA 
CTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 



FIGURE 134 



CACACTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCAAGGGTCNGGAAAACAGCAACAA 
GCTGAGCTGCTGTGACAGAGGGAACAAGATGGCGGCGCCGAAGGGAGCCTTTGGGTGAGGAC 
CCAACTGGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCG 
CTTCGGCTGAAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAG 
TTGACCTACCCCTTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTG 
CAGGCTGTTTTCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGG 
AATGTGAATCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTT 
GGTTGCCAGAATCAGCTGCCATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCC 
AAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACT 
CCGC 



FIGURE 135 



GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 
GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 
GTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGC AATGG GCCGCGGCTG 
GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 
AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 
ACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAA 
ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 
GGAATGACATCAGCCAGTGTGGAAG.AAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 
GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 
AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 
AGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGAC 
ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTA 
CAAGGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGAAGAAAACTGTTTTAAGCCAC 
AGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACT 
TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGG 
CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 
AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 
GGAGAAGGTCCAAGAAGGCTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTT 
ATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 
AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 
CATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 
CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 
GTCTGTGGGGAAAGCTTCAGACTCAGtGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 
AAATTGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGA 
AATAGTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACT 
TCAGGAACTTGTTACAGAATATTCAT TAAA GAAAACAAGCTGATATGTGCCTGTTTCTGGAC 
AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAA 
ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 
AAAAATTGTGTTAAGTCTATGTATAJiTACTACTGTGAGTAAAAGTAATACTTTAATAATGTG 
GTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 13 6 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA53974 
<subunit 1 of 1, 468 aa, 1 stop 
<MW: 54393, pi: 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYRLF 
PRLQKLLESDYFRYYK^LKRPCPE-WNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 
NNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLLNPE 
RYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQGTSEENTFYSWLEGLCVEKRAFY 
RLISGLHASINVHLSARYLLQETWLEKKWGHNITEFQQRFDGILTEGEGPRRLKNLYFLYLI 
ELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAH 
KLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFH 
LTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Amidation site. 

amino acids 94-97 

Glycosaminoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class -V pyridoxal -phosphate 

amino acids 216-222 

Interleukin-7 proteins 

amino acids 33 8-343 



mm 



FIGURE 137 



GCTGGAAATATGGATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAAAGACCTT 
TAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGNACACTTTTTACAGTTGGCTAGAA 
GGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAA 
TGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACA 
ACATTACAGAATTTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGG 
CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATT 
CTTNGAGCGCCCAGATTTTCAACTNTTTACTGGAAATAAAATTCAGGATGAGGNAAACAAAA 
TGTTACTTTTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 
TTTTTTTGCTG 



FIGURE 138 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA 
AGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTCATACTGGACAGAAAC 
CGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGG 
TCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCCACCAGAAG 
CTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGC 
GCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGG 
GGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACTGGGAAATTCATCTC 
ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 
ATGGTGAGCTAAGGAGAGGGTGGTGGCAGTGTCTCTGAAGGTCCATAAAAGAAAAAAGAGAA 
GTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAA 
AGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCAACTGGGAGCATGTTCTGAGGGT 
GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 
CGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATATGTGCCCGTGTGGATGCTTCATTCCA 
GCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCCAGAGGAAGGCTGAGTACTTGGTT 
CCCAGAAGGAGATACTGGGTGGGAAAAAGATGGGGCAAAGCGGTATGATGCCTGGCAAAGGG 
CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 
TTCAGACTCCTGGCCAGGTGTGGTGGCCCACACCTGTAATTCTAGCACTTTGGGAGGCCAAG 
GTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCAT 
CTCTACTAAAAAAAAAAAAATACAAAAATTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTC 
ATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTG 
AGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAA 
AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 
CTCATGCCTGTAATCCCAACATTTTGGGAGGCCAACGCAGGAGGATTGCTTGAGGTCTGGAG 
GTTTGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 



IT 



FIGURE 139 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57039 
xsubunit 1 of 1, 124 aa, 1 stop 
><MW: 13352, pi: 5.99, NX{S/T): 1 

MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DGPSGDRRGDVYRCPVGGAHNAPCAKGHLGDYQLGNSSHPAVNMHLGMSLLETDGDGGFMVS 

Important features: 
Signal peptide: 

amino acids 1-22 

Cell attachment sequence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 



Integrins alpha chain proteins 

amino acids 67-81 



FIGURE 140 



CACAGTTCCCCACCATCACTCNTCCCATTCCTTCCAACTTTATTTTTAGCTTGCCATTGGGA 
GGGGGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 
TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 
CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 
ATTCCCAGGGCCACCAGAAGCTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGAC 
AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 
TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 
CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 
CAGATGGTGATGG 



FIGURE 141 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 
GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 



FIGURE 142 



</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57033 
<subunit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5.04, NX(S/T): 2 

MQTFTMVLEE I WTSLFMWFFYAL I PCLLTDE VAI LPAPQNLSVLSTNMKHLLMWS PVIAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
GI PVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAI PLVLAL FAFVGFML I LV 
WPLFVWKMGRLLQYS CC PVWLPDTLKITNS PQKL I SCRREE VDACATAVMS PEELLRAWI S 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins. 

amino acids 92-119 

Integrins alpha chain proteins 

amino acids 232-262 



FIGURE 14 



TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGl'ACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 
ACATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 



FIGURE 144 



CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 
GGCAGGAGAGGAGGAGGTGGGGAGA.GCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 
GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAG 
GAGGAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAAC 
GGAGAGGAGGTGTGGGTTAGGAACC.AAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 
GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 
AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 
GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 
GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 
CAGGGGTCCCTCGGAGGCCTCCTGGGGATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGC 
GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 
ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 
CTGGTGAATGCAGCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 
GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 
AGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCT 
GTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCT 
GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 
AGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 
CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 
CAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 
TTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 
TCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 
TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 
TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 
AACAGGGACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 
TGGTGTCCCCCATGGTCGCTGAGACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 
CCCACAAGGCGAGGGGAGTTACCCCT'AAAACAAAGCTATTAAAGGGACAGAATACTTA 



FIGURE 145 



< / usr / s eqdb2 / s s t/DNA/ Dnas eqs . min / s s . DNA3 4353 
<subunit 1 of 1, 328 aa, 1 stop 
<MW: 36238, pi: 9.90, NX(S/T): 3 

MGAAARLSAPRALVLWAALGAAAH3IGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLC 
AVGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPVVNVSGGPL 
LYSHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSL 
FWVASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSE 
TVTWILIDRALNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 



FIGURE 146 



GGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCAAGAGGTCGCCGCCAGCCTCCGCCGCCGAGCCTC 
GTTCGTGTCCCCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAA.CGCTGGGGCGCCCACCCTGGCAGACTAACGAA 
GCAGCTCCCTTCCCACCCCAACTGCAGGTCTAATTTTGGAaSCTTTGCCTGCCATTTCTTCCAGGTTGAGGGAGC 
CGCAGAGGCGGAGGCTCGCGTATTCCTGC^GTCAGC^CCC^CGTCGCCCCCGGACGCTCGGTGCTC^GGCCCTTC 
GCGAGCGGGGCTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGGGCGGCTGCAGAGGCCGGCCGTCCGGTTTGGCT 
CACCTCTCCCAGGAAACTTCACACTGGAGAGCCAAAAGGAGTGGAAGAGCCTGTCTTGGAGATTTTCCTGGGGAA 
ATCCTGAGGTC^TTCATT ATGA AGTGTACCGCGCGGGAGTGGCTCAGAGTAACCAC^GTGCTGTTr!ATGGr!TAGA 
GCAATTCCAGCCATGGTGGTTCCCAATGCCA.CTTTATTGGAGAAACTTTTGGAAAAATACATGGATGAGGATGGT 
GAGTGGTGGATAGCCAAACAACGAGGGAAAAGGGCCATCACAGACAA 

AATAAATTACGAAGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACATGGGATGTAGAGCTGGAAAGA 

TCTGCAGAATCCTGGGCTGAAAGTTGCTTGTGGGAACATGGACCTGCAAGCTTGCTTCCATCAATTGGACAGAAT 

TTGGGAGCACACTGGGGAAGATATAGGCCCCCG&CGTT^ 

AGCTACCCATATGAACATGAATGCAACCCATATTGTCCM 

CAGGTCGTGTGGGCAACTAGTAACAGAATCGGTTGTGCC^ 

ATATGGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCATGCCCCTTACAAA 
CATGGGCGGCCCTGTTCTGCTTGCCGACCTAGTTTTGGAGGGGGCTGTAGAGAAAATCTGTGCTACAAAGAAGGG 
TCAGACAGGTATTATCCCCCTCGAGAAGAGGAAACAAATG 

TGTGAAGTAAGATTAAGAGATCAGTGCAAAGGAACAACCTGCAATAGGTACGAATGTCCTGCTGGCTGTTTGGAT 
AGTAAAGCTAAAGTTATTGGCAGTGTACATTATGAAATGCAATCCAGCATCTGTAGAGCTGCAATTCATTATGGT 
ATAATAGACAATGATGGTGGCTGGGTAGATATCACTAGACAAGGAAGAAAGCATTATTTCATCAAGTCCAATAGA 
AATGGTATTCAAACAATTGGCAAATATCAGTCTGCTAATTCCTTCACAGTCTCTAAAGTAAGAGTTCAGGCTGTG 
ACTTGTGAAACffiACTGTGGAACAGCTCTGTCCATTTC^ 

CGTAACTGTATGCAAGCAAATCCACATTATGCTCGTGTAATTGGAACTCGAGTTTATTCTGATCTGTCCAGTATC 
TGCAGAGCAGCAGTACATGCTGGAGTGGTTCGAAATCACGGTGGTTATGTTGATGTAATGCCTGTGGACAAAA.GA 
AAGACCTACATTGCTTCTTTTCAGAATGGAi\TCTTCTCAGAAAGTTTACAGAATCCTCCAGGAGGAAAGGCATTC 
AGAGTGTTTGCTGTTGTGjraAAACTGAATACTTGGAAGAGGACCATAAAGACTATTCCAAATGCAATATTTCTGA 
ATTTTGTATAAAACTGTAACATTACTGTACAGAGTACAT^ 

TAAATCTTGATAAACAAAGTCTATAAAATAAAACATGGGACATTAGCTTTGGGAAAAGTAATGAAAATATAATGG 
TTTTAGAAATCCTGTGTTAAATATTGCTATATT^ 

GTTCTACGTTTCATATATTATATGGTGCTTTGTATATGCCACTAATAAAATGAATCTAAACATTGAATGTGAATG 

GCCCTCAGAAAATCATCTAGTGCATTTAAAAATAA 

AGTTC^^TGCTATGCCATTACCAACTCCAAATAATCT 

TGTTAATTTAGGCATATAGAATATTAAATTCTGATATTGCACTTCTTATTTTATATAAAATAATCCTTTAATATC 
CAAATGAATCTGTTAAAATGTTTGATTCCTTGGGAATGGCCTTAAAAATAAATGTAATAAAGTCAGAGTGGTGGT 
ATGAAAACATTCCTAGTGATCATGTAGTAAATGTAGGGTTAAGCATGGACAGCCAGAGCTTTCTATGTACTGTTA 
AAATTGAGGTC^CATATTTTCTTTTGTATCCTGGC^^ 

GAACAAAGATGAACTAATGTATTACATTAC(^TTGCCACTGATTTTTTTTAAATGGTAAATGACCTTGTATATAA 
ATATTGCCATATCATGGTACCTATAATGGTGATATATTTGTTTCTATGAAAAATGTATTGTGCTOTGATACTAAA 
AATCTGTAAAATGTTAGTTTTGGTAATTTTTTTTCTGCTGGTGGATTTACATATTAAATTTTTTCTGCTGGTGGA 
TAAAC^TTAAAATTAATCATGTTTCAAAAAilAAAAAAA 



FIGURE 147 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX(S/T): 2 

MKCTAREWLRVTTVLFMARAIPAMVVPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITDNDM 
QS I LDLHNKLRS QVY PTASNME YMTWDVELERSAES WAES CLWEHGPAS LLPS IGQNLGAHW 
GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLC 
HNMNIWGQIWPKAWLVCNYSPKGlSr^GHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 
YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNR 
YECPAGCLDSKAKVIGSVHYEMQSS I CRAAIHYGI IDNDGGWVDITRQGRKHYFI KSNRNGI 
QTIGKYQSANSFTVSKVTVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 
TRVYSDLSSICRAAVHAGWRNHGGYVDVMPVDKRKTYIASFQNGIFSESLQNPPGGKAFRV 
FAW 

Important features: 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 protein 

amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 

amino acids 28-31 



FIGURE 148 



GCGGAGACAAGCGCAGAGCGCAGCGJCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 
CAGCCGGAGCCAGCAGAGCCGGAAC5GCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 
GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCATGTCCCTGCTCCCACGCCG 
CGCCCCTCCGGTCAGCATGAGGCTCICTGGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGT 
ACACCGCGCGTGTGGACGGGTCCAAATGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 
AGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTACCCGCACTGCGAGGAGAAGATGGTTAT 
CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 
AGAGCACCAAGCGCTTCATCAAGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAA 
GAATAGGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 
CTTTG<^GATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCCTTTC 
TTTCTCACAGGCATAAGACACAAATTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 
TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 
CCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAA 
AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 
GGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 
TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 
GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGTCTCT 
CTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 
GGTTAATTTTTACACACCCTAGGAAACATTTCCAAGATCCTGTGATGGCGAGACAAATGATC 
CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 
TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 
TAAAAATATATGAATACATGCGCAATACACAGCTACAGACACAC^TTCTGTTGACAAGGGAA 
AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 
TTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 
GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 
CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 
TGCATGCAGGTTTCATATTCTTTCTiiAGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 
AAAAAAAAAAA 



FIGURE 149 



MSLLPIUUlPPVSMRLLAAALLLLLIjALYTiyiVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPH 
CEEKMVIITTKSVSRYRGQEHCLHPKLQSTICRFIKWYNAWNEKRRVYEE 

Signal sequence: 

amino acids 1-34 



FIGURE 150 



GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCACCCCQGTCTGCGTCATGTTJUl&CTCCAATGTCCTCCTGTG 

GTTAACTGCTCTTGCCATC^GTTCACCCTC^TTGACAGCCAA^^ 

CIAAAATCCGGGGCCTAAGAACACCGTTACCCAATGAGATCT 

TGCCTCACCCCCCACTGGAGAGAGGCGGTTTCAGCCCCCAGAACCCCCGTCCTCCTGGACTGGCATCCGAAATAC 
TACTCmGTTTGCTGCTGTGTGCCCCCAGCaCCTGGATGAGAGATCCTTACTGCATGACATGCTGCCCATCTGGTT 
TACCGCCAATTTGGATACTTTG&TGACCTATGTTCAAGATC^^ 

GCCCACGGAAGATGGAGCCAACAC&AAGAAAAACGCAGATGATATAACGAGTAATGAC^ 

TATTCATGATCAGAACAGTAAGAAGCCCGTCATGGTCTATATCCATGGGGGATCTTACATGGAGGGCaCCGGCAA 

CATGATTGACGGCAGCATTTTGGCAAGCTACGGAAACGTCATCGTGATCACCATTAACTACCGTCTGGGAATACT 

AGGGTTTTTAAGTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATC^GATTCAAGCACTGCGGTG 

GATTGAGGAGAATGTGGGAGCCTTTGGCGGGGACCCCAAGAGAGTGACCATCTTTGGCTCGGGGGCTGGGGCCTC 

CTGTGTCAGCCTGTTGACCCTGTCCCACTACTCAGAAGGTCTCTTCCAGAAGGCCATCATTCAGAGCGGC^VCCGC 

CCTGTCCAGCTGGGC^GTGAACTACCAGCCGGCCAAGTACACTCGGATATTGGCAGACAAGGTCGGCTGCAACAT 

GCTGGACACCACGGAC^TGGTAGAATGCCTGCGGAACAAGAACTACAAGGAGCTCATCCAG 

GGCCACCTACCACATAGCCTTCGGGCCGGTC3ATCGACGGCGACGTCATCCCAGACGACCCCCAGATCCTGATGGA 

GGAAGGCGAGTTCCTC^ACTACGACATCATGCTGGGCGTCAACCAAGGGGAAGGCCTGAAGTTCGTGGACGGCAT 

CGTGGATAACGAGGACGGTGTGACGCCCAACGACTlTGACTTCTCCGTGTCC^CTTCGTGGAC^CCTTTAaSG 

CTACCCTGAAGGGAAAGACACTTTGCGGGAC3ACTATCAAGTTCATGTACACAGACTGGGCCGATAAGGAAAACCC 

GGAGACGCGGCGGAAAACCCTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCT 

GCACGCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCIAAAGCGAAATGAAGCCCAGCTG 

GGCAGATTCGGCCCATGGTGATGAGGTCCCCTATGTCTTCGGCATCCCCATGATCGGTCCCACCGAGCTCTTCAG 

TTGTAACTTTTCCAAGAACGACGTCATGCTCAGCGCCGTGGTCATGACCTACTGGACGAACTTCGCCAAAACTGG 

TGATCCAAATCAACCAGTTCCTCAGGATACC'AAGTTCATTCACACAAAACCCAACCGCTTTGAAGAAGTGGCCTG 

GTCCAAGTATAATCCCAAAGACCAGCTCTATCTGCATATTGGCTTGAAACCCAGAGTGAGAGATCACTACCGGGC 

AACGAAAGTGGCTTTCTGGTTGGAACTCGTTCCTCATTTGCACAACTTGAACGAGATATTCCAGTATGTTTCAAC 

AACCACAAAGGTTCCTCCACCAGACATGACTTCATTTCCCTATGGCACCCGGCGATCTCCCGCCAAGATATGGCC 

AACCACCAAACGCCCAGCAATCACTCCTGCCAACAATCCCAAA.CACTCTAAGGACCCTCACAAAACAGGGCCTGA 

GGACACAACTGTCCTCATTGAAACCAAACGAGATTATTCCACCGAATTAAGTGTCACCATTGCCGTCGGGGCGTC 

GCTCCTCTTCCTC^CATCTTAGCTTTTGCGGCGCTC^ 

GCGCCCCAGTCCCC^GAGAAACACCACAAATGATATCGCTCACATCCAGAACGAAGAGATCATGTCTCTGCAGAT 
GAAGCAGCTGGAACACGATCACGAGTGTGAGTCGCTGCAGGCACACGACACACTGAGGCTCACCTGCCCGCCAGA 
CTACACCCTCACGCTGCGCCGGTCGCCAGATGACATCCCACTTATGACGCCAAACACCATCACCATGATTCCAAA 
CACACTGACGGGGATGCAGCCTTTGCACACTTTTAACACCTTCAGTGGAGGACAAAACAGTACAAATTTACCCCA 
CGGACATTCCACCACTAGAGT ATAGC TTTGCCCTATTTCCCTTCCTATCCCTCTGCCCTACCCGCTCAGCAACAT 
AGAAGAGGGAAGGAAAGAGAGAAGGAAAGAGAGAGAGAAAGAAAGTCTCCAGACCAGGAATGTTTTTGTCCCACT 
GACTTAAGACAAAAATGCAAAAAGGCAGTCATCCCATCCCGGCAGACCCTTATCGTTGGTGTTTTCCAGTATTAC 
AAGATC^CTTCTGACCCTGTGAAATGTGAGAAGTACACATTTCTGTTAAAATAACTGCTTTAAGATCTCTACCA 
CTCCAATCAATGTTTAGTGTGATAGGACATGACCATTTCAAGGCCCCGGGTGTTTCCAACGTCATGGAAGCAGCT 
GACACTTCTGAAACTCAGCCy^GGACACTTGATATTTTTTAATTACAATGGAAGTTTAAACATTTCTTTCTGTGC 
CACACAATGGATGGCTCTCCTTAAGTGAAGAAAGAGTCAATGAGATTTTGCCCAGCACATGGAGCTGTAATCCAG 
AGAGAAGGAAACGTAGAAATTTATTATTAAAAGAATGGACTGTGCAGCGAAATCTGTACGGTTCTGTGCAAAGAG 
GTGTTTTGCCAGCCTGAACTATATTTAAGAGACTTTGT 



FIGURE 151 

MLNS3WLLWLTALAIKFTLIDSQAQYPVVNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASP 
PTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQN 
EDCLYLNIYVPTEDGANTKKNADDITSNDRGEDEDIHDQNSKKPVMVYIHGGSYMEGTGNMI 
DGS I LAS YGNVI VI T INYRLG I LGFLSTGDQAAKGNYGLLDQ I QALRWI EENVGAFGGDPKR 
VT I FGSGAGAS CVS LLTLSHYSEGLFQKAI I QSGTALS S WAVNYQPAKYTR I LADKVGCNML 
DTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLNYDIMLGV 
NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENP 
ETRRKTLVALFTDHQWVAPAVAADLI-IAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 
FGIPMIGPTELFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVA 
WSKYNPKDQLYLHIGLKPRVRDHYRi^TKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTS 
FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVG 
ASLLFLNILAFAALYYKKDKRRHETHRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECE 
SLQAHDTLRLTCPPDYTLTLRRS PDD I PLMTPNTI TMI PNTLTGMQPLHTFNTFSGGQNSTN 
LPHGHSTTRV 

Signal sequence: 

amino acids 1-24 



Transmembrane domains : 

amino acids 189-204, 675-692 



FIGURE 152 



GGGAAAGATGGCGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGT 
CGGCTCGGGATGGGTCCAGGATGTTACTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCA 
CAGCAAGTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCC 
CTACCAGGGTGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGA 
TGACCCAGTATATCCGCCTTACCCCAGATATGCAAAGTAAACAGGGTGCCTTGTGGAACCGG 
GTGCCATGTTTCCTGAGAGACTGGGAGTTGCAGGTGCACTTCAAAATCCATGGACAAGGAAA 
GAAGAATCTGCATGGGGATGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGC 
CTGTGTTTGGAAACATGGACAAATTTGTGGGGCTGGGAGTATTTGTAGACACCTACCCCAAT 
GAGGAGAAGCAGCAAGAGCGGGTATTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCT 
CAGCTATGATCATGAGCGGGATGGGfCGGCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCC 
GCAATCTTCATTACGACACCTTCCTGGTGATTCGCTACGTCAAGAGGCATTTGACGATAATG 
ATGGATATTGATGGCAAGCATGAGTGGAGGGACTGCATTGAAGTGCCCGGAGTCCGCCTGCC 
CCGCGGCTACTACTTCGGCACCTCCTCCATCACTGGGGATCTCTCAGATAATCATGATGTCA 
TTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCCCAGAAGAGGAAAAGCTCCATCGA 
GATGTGTTCTTGCCCTCAGTGGACAA.TATGAAGCTGCCTGAGATGACAGCTCCACTGCCGCC 
CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 
TAGTCATTGGTATCATACTCTACAACAAATGGCAGGAACAGAGCCGAAAGCGCTTCTAC TGA 
GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 
GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 
TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATCTAACT 
CTGGTCTGGGAAGCCACC CAC CCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 
TTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 
AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 
AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 
TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 
TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 
GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 
TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 
GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 
AAGTTTGGCTAAAGGTTGGTGTAAAAATCAAGAGAAGCCTGGAAGACATCATGGATGCCATG 
GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTG 
GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGT 
TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 
GGGCTTGGATATTGCCCAGAGAAGAAATTTGGCTTTTTTTTTCTTAATGGACAAGAGACAGT 
TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 
CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 
TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCAG 
AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGCCTAAATGAAAT 
TCTTAAAATCACCGATGGAACCAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 



></usr/ seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA50911 
xsubunit 1 of .1, 348 aa, 1 stop 
><MW: 39711, pi: 8.70, NX(S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKPYQ 

GVGTGS S S LWNLMGNAMVMTQYI RLTPDMQS KQGALWNRVP CFLRDWE LQVHFKI HGQGKKN 

LHGDGIAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFPYISAMV1M 

DHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRG 

YYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLS 

GLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY 

Signal sequence: 

amino acids 1-3 8 



Transmembrane domain: 

amino acids 310-329 



FIGURE 154 



CCGAGCCGGGCGCGCAGCGACGGAGCTGGGGCCGGCCTGGGACCATGGGCGTGAGTGCAATCTACGGATCAGTCT 
CTGATGGTGGGTCGTTAACCTCAGTGGGGAGTCCAAGATTTCCATGAAGAAAATCAGTTGTCTTCATTC^GAAT 
TGGGGTCTGGCTCAGAATTCCTGCAGCTGGTGAAAATCTGTTTTCTAGAAGAGGTTTAATTAATGCCTGCAGTCT 
GACATGTTCCCGATTTGAGGTGAAACCATGAAGAGAAAATAGAATACTTAATAMGCTTTTCCGCBACCGCTTCT 
TGCTGCTGCTGGCCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGATCCCGGTGT 
CGACTCCTAAGAATGGAATGAGTAGCAAGAGTCGAAAGAGAATCATGCCCGACCCTGTGACGGAGCCCCCTGTGA 
CAGACCCCGTTTATGAAGCTCTTTTGTACTGCAACATCCCCAGTGTGGCCGAGCGC^GCATGGAAGGTCATGCCC 
CGCATCATTTTAAGCTGGTCTCAGTGCATGTGTTCA^ 

aaacaaagcgaccagaaattgactgcactc:tggtggctaac^ggaaaccgtatcacccaaaactggaagctttca 
ttagtcacatgtcaaaaggatccggagcctctttc^^ 

cattgtgtgagatgggagagctcacacagacaggagttgtgcagcatttgcagaacggtcagctgctgagggata 

TCTATCTAAAGAAACACAAACTCCTGCCCAATGATT^ 

GCCGGACCCTAO^GTGGGOTGGCCTTGCTTTATGGCTTTCTCCCAGATTTTGACTGGAAGAAGATTTATTTCA 
GGCACCAGCQ^GTGCGCTGTTCTGCTCTGGAAGCTGCTATTGCCCGGTAAGAAACCAGTATCTGGAAAAGGAGC 
AGCGTCGTCAGTACCTCCTACGTTTGAAAAAC^GCCAGCTGGAGAAGACCTACGGGGAGATGGCCAAGA 
ATGTCCCCACC^GCAGCTTAGAGCTGCCA^^ 

TTCCCTGTACCAGAAATGGCTGTGTTGACATGGAGCACTTCAAGGTAATTAAGACCCATCAGATCGAGGATGAAA 
GGGAAAGACGGGAGAAGAAATTGTACTTCGGGTATTCTCTCCTGGGTGCCCACCCCATCCTGAACCAAACCA.TCG 
GCCGGATGCAGCGTGCCACCGAGGGCAGGAAAGAAGAGCTCTTTGCCCTCTACTCTGCTCATGATGTCACTCTGT 
CACCAGTTCTCAGTGCCTTGGGCCTTTCAGAAGCCAGGT^ 

ggcaagacagagaaaagcccagtgaacattccgtccggattctttacaatggcgtcgatgtcacattccacacct 
ctttctgcc^gacc^cc^c^gcgttctccc^gccc^tgtgcccgcttgaaaacttggtccgctttgtgaaaa 
gggac^tgtttgtagccctgggtggcagtggtacaaattattatgatgcatgtcacagggaaggatt ctaaa agg 
tatgcagtacagcagtatagaatcc^tgccaatacagagaitagggaaaggtccacttctagttttgtctgttac 
taagggtagaagattattgctttttaaaggctaaatattgtttgtgggaaccacagatggttggggttgaacagt 
aagcacattgctgcaatgtggtacgtgaattgcttggtacaaaatggccagttcacagaggaatagaaggtactt 
tatcatagccagacttcgcttagaatgccagaataatat^ 

tcttctggcctgccccatgttactatgtgatggaaccagcacacctcaaccaaaatttttttaatcttagacatt 
tttaccttgtccttgttaagaatttcttgaagtgatttatctaaaataaaggttggcaaactttttctgtaaagg 
gccagattgtaaatatttcagactgtgtgg&ccaaa 

TTCTGAAGCAGGAAAGCCACCACAGACAGTACATAAAGGAATATGTGTAGCTGGGTTCCCAGGCCAGACAAA^ 

GATGGTGACCAGACTTGGCCCCTGGGCTGTAGTTTGCTGACCCCTCATCTAAAAAATAGGCTATACTACAATTGC 

ACTTCCAGCACTTTGAGAACGAGTTGAATACCAAGA 

CAGAATTTGGTCTGTATCTGACACTAGAAG?^AAACTTGAGGGTAAATAAACATTGAATTAGAATGAATCATAGAA 
AACTGATTAGAAGAATACTTGATGTTTATGATGATTGTGGTACAAGATAGTTTTAAGTATGTTCTAAATATTTGT 
CTGCTGTAGTCTATTTGCTGTATATGCTGA^ATTTTTGTATGCCATTTAGTATTTTTATAGTTTAGGAAAATATT 
TTCTAAGACCAGTTTTAGATGACTCTTATTCCTGTAGTAATATTCAATTTGCTGTACCTGCTTGGTGGTTAGAAG 
GAGGCTAGAAGATGAATTCAGGCACTTTCTTCCAATAAAACTAATTATGGCTCATTCCCTTTGACAAGCTGTAGA 
ACTGGATTCATTTTTAAACCATTTTCATCAGTTTCAAATGGTAAATTCTGATTGATTTTTAAATGCGTTTTTGGA 

AACTTTTTCTCTTTGTTTTTGTCCAGTGTTGCATTTGAATATGTCTGTTTCTATAAATAAATTTTTTAAGAATAA 



FIGURE 155 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48329 
xsubunit 1 of 1, 480 aa, 1 stop 
><MW: 55240, pi: 9.30, NX{S/T): 2 

MLFRNRFLLLLALAALLAFVS LSLQFFHL I PVSTPKNGMS SKSRKRIMPDPVTEPPVTDPVY 
EALLYCNI PSVAERSMEGHAPHHFKLVSVHVFIRHGDRYPLYVI PKTKRPE IDCTLVANRKP 
YHPKLEAFISHMSKGSGASFESPLNSLPLYPNHPLCEMGELTQTGWQHLQNGQLLRDIYLK 
KHKLLPNDWSADQLYLETTGKSRTLQSGLALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCP 
VRNQYLEKEQRRQYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHNVSFPC 
TRNGCVDMEHFKVIKTHQIEDERERREKKLYFGYSLLGAHPILNQTIGRMQRATEGRKEELF 
ALYSAHDVTLSPVLSALGLSEARFPRFAARLI FELWQDREKPSEHSVRI LYNGVDVTFHTSF 
CQDHHKRSPKPMCPLENLVRFVKRDMFVALGGSGTNYYDACHREGF 



Signal sequence: 

amino acids 1-18 



FIGURE 156 

AAAAAAGCTCACTAAAGTTTCTATTAGAGCGAATACGGTAGATTTCCATCCCCTTTTGAAGAACAGTACTGTGGA 
GCTATTTAAGAGATAAAAACGAAATATCCTTTCTGGGAGTTCAAGATTGTGCAGTAATTGGTTAGGACTCTGAGC 
GCCGCTGTTC^CC^TCGGGGAGAGAAAAGCGGAGATCCTGCTCGCCTTGCaCGCGCCTGAAGCACAAAGC^GAT 
AGCTAGGAATGAACCATCCCTGGGAGTATGTGGAAACAACGGAGGAGCTCTGACTTCCCAACTGTCCCATTCTAT 
GGGCGAAGGAACTGCTCCTGACTTCAGTGGTTAAGGGCAGAATTGAAAATAATTCTGGAGGAAGATAAGAATGAT 
TCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGACTCTGTGGGAGACCGG 
ATGCACCCAGATACGCTATTC^GTTCCGGAkGAGCTGGAGAAAGGCT 

GGGGCTGGAGCCCCGGGAGCTCGCGGAGCGCGGAGTCCGCATCATCCCCAGAGGTAGGACGCAGCTTTTCGCCCT 
GAATCCGCGCAGCGGCAGCTTGGTCACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCCATCAAGTG 
TCAATTAAATCTAGACATTCTGATGGAGGATAAAGTGAAAATATATGGAGTAGAA.GTAGAAGTAAGGGACATTAA 
CGACAATGCGCCTTACTTTCGTGAAAGTGAATTAGAAATAAAAATTAGTGAAAATGCAGCCACTGAGATGCGGTT 
CCCTCTACCCCACGCCTGGGATCCGGATATCGGGAAGAACTCTCTGCAGAGCTACGAGCTCAGCCCGAACACTCA 
CTTCTCCCTCATCGTGCAAAATGGAGCCGACGGTAGTAAGTACCCCGAATTGGTGCTGAAACGCGCCCTGGACCG 
CGAAGAAAAGGCTGCTCACCACOTGGTCCTTACGGCCTCCGACGGGGGCGACCCGGTGCGCACAGGCACCGCGCG 
CATCCGCGTGATGGTTCTGGATGCGAACGACAACGCACCAGCGTTTGCTCAGCCCGAGTACCGCGCGAGCGTTCC 
GGAGAATCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACGCTACCGACCCTGACGAAGGAGTCAATGCGGAAGT 
GAGGTATTCCTTCCGGTATGTGGACGACAAGGCGGCCCy^GTTTTCAAACTAGATTGTAATTCAGGGACAATATC 
AACAATAGGGGAGTTGGACCACGAGGAGTC^GGATTCTACCAGATGGAAGTGCAAGCAATGGATAATGCAGGATA 
TTCTGCGCGAGCCyiAAGTCCTGATCACTGTTCTGGACGTGAACGACAATGCCCCAGAAGTGGTCCTCACCTCTC^ 
CGCC^GCTCGGTTCCCGAAAACTCTCCCaGAGGGACATTAATTGCCCTTTTAAATGTAAATGACC^AGATTCTGA 
GGAAAACGGACAGGTGATCTGTTTCATCCAAGGAAATCTGCCCTTTAAATTAGAAAAATCTTACGGAAATTACTA 
TAGTTTAGTCACAGACATAGTCTTGGATAG(3GAACAGGTTCCTAGCTACAACATCACAGTGACCGCCACTGACCG 
GGGAACCCCGCCCCTATCCACGGAAACTCATATCTCGCTGAACGTGGCAGAC!ACCAACGACAACCCGCCGGTCTT 
CCCTCAGGCCTCCTATTCCGCTTATATCCGAGAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGACCGCCCA 
CGACCCCGACTGTGAAGAGAACGCCCAGATCACTTATTCCCTGGCTGAGAACACCATCCAAGGGGCAAGCCTATC 
GTCCTACGTGTCCATCAACTCCGACACTGGGGTACTGTATGCGCTGAGCTCCTTCGACTACGAGCAGTTCCGAGA 
CTTGCAAGTGAAAGTGATGGCGCGGGACAACGGGCACCCGCCCCTCAGCAGCAACGTGTCGTTGAGCCTGTTCGT 
GCTGGACCAGAACGACAATGCGCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGGAGCT 
GGCTCCCCGCTCCGCAGAGCCCGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACAGAGACTCCGGCCAGAACGC 
CTGGCTGTCCTACCGTCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCTGCACACGGGCGAGGTGCG 
CACGGCGCGAGCCCTGCTGGACAGAGACGCGCTC^GCAGAGCCTCGTAGTGGCCGTC<^GGACCACGGCCAGCC 
CCCTCTCTCCGCCACTGTCACGCTCACCGTGGCCGTGGCCGACAGCATCCCCCAAGTCCTGGCGGACCTCGGCAG 
CCTCGAGTCTCCAGCTAACTCTGAAACCTCAGACCTCACTCTGTACCTGGTGGTAGCGGTGGCCGCGGTCTCCTG 
CGTCT'TCCTGGCCTTCGTCATCTTGCTGCTGGCGCTCAGGCTGCGGCGCTGGCACAAGTCACGCCTGCTGCAGGC 
TTCAGGAGGCGGCTTGACAGGAGCGCCGGCGTCGCACTTTGTGGGCGTGGACGGGGTGCAGGCTTTCCTGCAGAC 
CTATTCCCACGAGGTTTCCCTCACCACGGACTCGCGGAAGAGTCACCTGATCTTCCCCCAGCCCAACTATGCAGA 
CATGCTCGTCAGCCAGGAGAGCTTTGAAAAAAGCGAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAAAGA 
CAGTCATGGGTTAATTGAGGTGAGTTTATATCAAATCTTCTTTCTTTTTTTTTTTAATTGCTCTGTCrCCCAAGC 
TGGAGTGCAGCGGTACGATC^TAGCTCACTGCGGCCTCAAACTCCTAGGCTOy^GCAATTATCCCACCTTTGCCT 
CCGGTGTAACAGGGACTACAGGTGCAAGCCACCTACTGTCTGCCTATCTATCTATCTATCTATCTATCTATCTAT 
CTATCTATCITATCTATCTATTACTTTCTTGTACAGACGGGAGTCTCACGCCTGTAATCCCAGTACTTTGGGAGGC 
CGAGGCGGGTGGATCACCTGAGGTTGGGAGTTTGAGACGAGCCTG^CCy^CATGGAGAAACCCCGTCTATACTAA 
AAAAATACAAAATTAGCCGGGCGTGGTGGTGCATGTCTGTAATCCCAGCTACTTGGGAGGCTGAGTCAGGAGAAT 
TGCTTTAACCTGGGAGGTGGAGGTTGCAATGAGCTGAGATTGTGCCATTGCACTCCAGCCTGGGCAACAAGAGTG 
AAACTCTATCTCA 



FIGURE 157 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48306 
xsubunit 1 of 1, 916 aa, 1 stop 
><MW: 100204, pi: 4.92, NX(S/T): 4 

MIPARLHRDYKGLVLLGILLGTLWETGCTQIRYSVPEELEKGSRVGDISRDLGLEPRELAER 
GVRI I PRGRTQLFALNPRSGSLVTAGRIDREELCMGAIKCQLNLD ILMEDKVKI YGVEVEVR 
D INDNAPYFRESELE I KI S ENAATEMRFPLPHAWDPD I GKNS LQS YE LS PNTHFS L I VQNGA 
DGSKYPELVLKRALDREEKAAHHLVLTASDGGDPVRTGTARIRVMVLDANDNAPAFAQPEYR 
ASVPENLALGTQLLWNATDPDEGWAEVRYSFRYVDDKAAQVFKLDCNSGTISTIGELDHE 
ESGFYQMEVQAMDNAGYSARAKVL I TVLDVNDNAPE WLTSLASS VPENS PRGTLI ALLNVN 
DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLST 
ETHISLNVADTNDNPPVFPQASYSAYIPEI^PRGVSLVSVTAHDPDCEENAQITYSIjAENTI 
QGAS LS S YVS INSDTGVLYAIiS S FD YEQFRDLQVKVMARDNGHPPLS SNVS LSLFVLDQNDN 
APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 
LHTGEVRTARALLDRDALKQSLVVAVQDHGQPPLSATVTLTVAVADSIPQVLADLGSLESPA 
NSETSDLTLYLWAVAAVSCVFIAFVILLLALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 
GVQAFLQTYSHEVSLTTDSRKSHLIFPQPNYADMLVSQESFEKSEPLLLSGDSVFSKDSHGL 
IEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 
IYLSIYLSIYLSIYLLLSCTDGSLTPVIPVLWEAEAGGSPEVGSLRPA 

Signal sequence: 

amino acids 1-3 0 

Transmembrane domains : 

amino acids 693-711, 809-823, 869-888 



CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAG 
GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 
TCAGTAGGTGACCCCGCCGCTGGATTCTGGAAGACCTCACCATGGGACGCCCCCGACCTCGT 
GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 
ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 
CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 
ACAGCTGCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
TAAAGATGGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACA 
GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
ATAGGCAGCAAGGGCTGATTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCT 
CTGGTTC 



FIGURE 159 

</ usr/ seqdb2 / ss t/DNA/Dnaseqs . min/ss . DNA4 8336 
<subunit 1 of 1, 260 aa, 1 stop 
<MW: 28048, pi: 7.87, NX(S/T): 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAAIjFQGQQLLCGGVL 
VGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPWQSIPHPCYNSSDVEDHNHDLMIxL 
QLRDQASLGSKVKPISLADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQICKCED 
AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site. 

amino acids 182-188 

Kringle domain proteins motif 

amino acids 2 05-217 



FIGURE 160 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 
CGCGCCGCCCACGCCCCAACCCCGGtCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGC 
GCCCGCGCCCAGGTGAGCGCTCCGC!CCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCG 
CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAAC 
ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 
CCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGC 
GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 
TGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 
CCCGCCCAACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 
CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 
GCCCAAGGTGACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTG 
CTGCCAGCCAGCGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTC 
CGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGC 
GGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCC 
GGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGC 
TGCGGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTA 
CCTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCA 
CACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGGCTG 
CACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTT 
CCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCTGCCCA 
CTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTG 
TGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGA 
GGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATG 
ACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACC 
GATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGT 
ACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAGGGACGCGTGCCGCCCGGTG 
ACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTG 
CCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTT 
CCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTTCACGCAAGAACCGCACCCGCAGCCACT 
GCCGTCTGGGCCAGGCAGGCAGCGGC3GGTGGCGGGACTGGTGACTCAGAAGGCTCAGGTGCC 
CTACCCAGCCTCACCTGCAGCCTCACCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCT 
TGGGCCCTGCTGACCCCCAGCGGACACAAGAGCGTGCTCAGCAGCCAGGTGTGTGTACATAC 
GGGGTCTCTCTCCACGCCGCCAAGCCAGCCGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAG 
GTCCTCCCTGATGGACGCCTGCCGCCCGCCACCCCCATCTCCACCCCATCATGTTTACAGGG 
TTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTCCCACCCAGATCGCGGTATATAGAGATAT 
GCATTTTATTTTACTTGTGTAAAAATATCGGACGACGTGGAATAAAGAGCTCTTTTCTTAAA 
AAAA 



FIGURE 161 



></usr/seq<db2/sst/DNA/Dnaseqs.min/ss.DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
><MW: 50708, pi: 9.28, NX(S/T): 6 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVIARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAP 
LRALQYLRLNDNPWVCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAWPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features: 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site. 

amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 



GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 
TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 
AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACAATGGAGGCCAGCGGGA 
AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 
GGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCAC 
CAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTG 
TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 
GAGAAATTGGACCGTGAGGATCTGTGCGGTCACACAGAGC C CTGTGTGCTACGTTTC CAAGT 
GTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACC 
ACTCTCCAGTATTTCTGGACAAACAAATGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGG 
ACTACGTTTCCTCTGAAGAATGCCGAAGACTTAGATGTAGGCCAAAACAATATTGAGAACTA 
TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 
ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 
CTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGT 
CCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 
AGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTC 
AACGGAGAGATTTCCTATTCACTTTTCCAAGCTTCAGAAGAGATTGGCAAAACCTTTAAGAT 
CAATCCCTTGACAGGAGAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCT 
ATGAAGTCAATATTGAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCACCGTTCTGATT 
CAAGTGATAGATGTGAACGAC CATGCCC CAGAAGTTACCATGTCTGCATTTAC CAGC CCAAT 
ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 
AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 
AACTTTTACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACAT 
CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 
TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 
CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 
CACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACAT 
CCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 
GCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCGCTGAGCAG 
CGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACC 
CGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTAC 
CTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCGGGCCAGAACGCCTGGCTGTCGTACCA 
GCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCA 
CCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTGGTGGTGCTGGTCAAGGAC 
AATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTGCTCCTGGTGGACGGCTTCTC 
CCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAGGCCCAGGCCGACTTGCTCACCG 
TCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCTCTTTTCGGTGCTCCTGTTC 
GTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTCGCTGCTTGGTGCCCGA 
GGGCCCCCTTCCAGGGCATCTTGTGGACATGAGCGGCACCAGGACCCTATCCCAGAGCTACC 
AGTATGAGGTGTGTCTGGCAGGAGGCTCAGGGACCAATGAGTTCAAGTTCCTGAAGCCGATT 
ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 
TAACTTTGGGTTCAATATTCAGTGACCATAGTTGACTTTTACATTCCATAGGTATTTTATTT 
TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 
TTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATT 
CCTGGTTCTT 



FIGURE 163 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48314 
<subunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 

MEASGKLICRQRQVLFSFLLLGLSI1AGAAEPRSYSVVEETEGSSFVTNI1AKDLGLEQREFSR 
RGWWSRGNKLHLQLNQETADLLLNEKLDREDLCGHTEPCVLRFQVLLESPFEFFQAELQV 
IDINDHSPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPNSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRIiTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQI SEDSPVGFLVVKVSATDVDTGVNGE I S YSLFQAS EE IGKTFKINPLTGE I ELKKQLDF 
EKLQSYEWIEARDAGTFSGKCTVLIQVIDVNDHAPEVTMSAFTSPIPENAPETWALFSVS 
DLDSGENGKI SCSI QEDLPFLLKS AENFYTLLTERPLDRESRAE YN I T ITVTDLGTPML ITQ 
LmTVLIADVNDNAPAFTQTSYTLFVRENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HLPLTSLVSINADNGHLFALRSLDYEALQGFQFRVGASDHGSPALSSEALVRWVLDANDNS 
PFVLYPLQNGSAPCTELVPRAAEPGYLVTKVVAVDGDSGQNAWLSYQLLKATELGLFGVWAH 
NGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHVLLVDGFSQPYLPLPEAAPTQAQ 
ADLLTVYLWALASVSSLFLFSVLLWAVRLCRRSRAASVGRCLVPEGPLPGHLVDMSGTRT 
LSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPGKEIQGNSTFPNNFGFNIQ 

Important features : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP -binding site motif A (P-loop) . 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 



FIGURE 164 



ACCCACGCGTCCGCCCACGCGTCCG(CCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGC 
GCCGATTGCCTCTCGGCCTGGGCAATGGTCCCGGCTGCCGGTCGACGACCGCCCCGCGTCAT 
GCGGCTCCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTGG 
AGGTTGCAGAGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTG 
GGGGCTGTGTACCTGGGTGAGGAGGAGCTCCTGCATGACCCGATGGGCCAGGACAGGGCAGC 
AGAAGAGGCCAATGCGGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTG 
TGATTCCTGGGGAAGCTGAGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCT 
GGAGGAGCGGAGGACTCAAGGTGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGCTGG 
AGCACACTTCCCTGACAGAGAAGAGGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACG 
CAGCCCCGACAGAGGACTCCAATAACACTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAG 
GAGAGAAACATTACAGGATTAGAAAATTTCACTCTGAAAATTTTAAATATGTCACAGGACCT 
TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 
GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 
CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 
TGTTCCTAATATTTTATTATTTCAAGGAGCTAAACCAATGGCCAGATTTAATCATACAGATC 
GAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTATAGAAGCCAAGAAGAAT 
GTGGTGGTAACTCAAGCCGACCAAATAGGCCCTCTTCCCAGCACTTTGATAAAAAGTGTGGA 
CTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGCTACCATTCGAA 
CTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGA GTAGT GATGGTCT 
GAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTTTCATA 
CATTTTCTCCAGTGACGTGTTGACTTGAAACTTCAGGCAGATTAAAAGAATCATTTGTTGAA 
CAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGC 
AAAAATATTCAATAG 



FIGURE 165 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48333 
xsubunit 1 of 1, 360 aa, 1 stop 
><MW: 39885, pi: 4.79, NX(S/T): 7 

MVPAAGRRPPRVMRLLGWWQ VL LWVLGL PVRGVEVAEE SGRLWS EEQPAHPLQVGAVYLGEE 
ELLHDPMGQDRAAEEANAVLGLDTQGDHMVMLSVIPGEAEDKVSSEPSGVTCGAGGAEDSRC 
NVRESLFSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLE 
NFTLKILNMSQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFLALDAS 
QHSSLSTRFGTVAVPNILLFQGAKPIIARFNHTDRTLETLKIFIFNQTGIEAKKNVVVTQADQ 
IGPLPSTLIKSVDWLLVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 321-340 

Homologous region to dilstifide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thioredoxin domain 

amino acids 211-227 



FIGURE 166 



CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATGCTGCAGGGCCCTGGCT 
CGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 
TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGCA 
GCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 
TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 
GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 
CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 
CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 
TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 
TGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGTAAAAATG 
ATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATC 
CTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAGGGACCTGAAGAA 
ATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACG 
CGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 
TGGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTA 
GTCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 
GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 
TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 
GATAGCTGTTTTCACCTAAAGGAAASkGCCCACCCGAATCTTGTAGAAATATTCAAACTAATA 
AAATCATGAATATTTTAA 



wm 



FIGURE 167 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50920 
xsubunit 1 of 1, 295 aa, 1 stop 
><MW: 33518, pi: 7-74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPVNLQLCHGIEYQNMRLPN 
LLGHETMKEVLEQAGAWI PLVMKQCHPDTKKFLCSLFAPVCLDDLDET IQPCHSLCVQVKDR 
CAPVMSAFGFPWPDMLECDRFPQDNIJLCIPIASSDHLLPATEEAPKV-CEACKNKNDDDNDIM 
ETLCKNDFALKIKVKEITYINRDTKIILETKSKTIYKLNGVSERDLKKSVLWLKDSLQCTCE 
EMNDINAPYLVMGQKQGGELVITSVKRWQKGQREFKRISRSIRKLQC 

Important features: 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 

amino acids 6-153 



FIGURE 168 



GTGGAGGCCGCCGACGATCGCGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGG 
CCCTGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATC 
AGCCTGCTCAACTGCTCCAACGCCACGCTGTGGCTCAGCTTTGCACCTGTGGCTGACGTCAT 
TGCTGAGGACTTGGTCCTGTCCATGGAGCAGATCAACTGGCTGTCACTGGTCTACCTCGTGG 
TATCCACCCCATTTGGCGTGGCGGCCATCTGGATCCTGGACTCCGTCGGGCTCCGTGCGGCG 
ACCATCCTGGGTGCGTGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGT 
TGTTGGGACCCAAAACCCATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCC 
AGAGCCTGGTCATCTTCTCTCCAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCAGCGA 
GCCACGGCCAACATGCTCGCCACCATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCT 
GTCCCCTGTGCTGGTCAAGAAGGGTGAGGACATTCCGTTAATGCTCGGTGTCTATACCATCC 
CTGCTGGCGTCGTCTGCCTGCTGTCCACCATCTGCCTGTGGGAGAGTGTGCCCCCCACCCCG 
CCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAGAAGTTCCTGGATGGGCTCAAGCTGCAGCT 
CATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTGCTTGGGGGGAATGATCGGGATCTCTG 
CCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAAGCGGCCACTCCAGTGGGTTTTCC 
GGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGGGCACTGGCTCTCGGCCCCTA 
TGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCTGTGCCTGTTCTCTCTGG 
CCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTGCCCTGGCTGCCACC 
TGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAGTTGGCGGTCGA 
GTGTTCCTTCCCCGTGGGGGAGGGGGCTGCCACAGGCATGATCTTTGTGCTGGGGCAGGCCG 
AGGGAATACTCATCATGCTGGCAATGACGGCACTGACTGTGCGACGCTCGGAGCCGTCCTTG 
TCCACCTGCCAGCAGGGGGAGGATCGACTTGACTGGACAGTGTCTCTGCTGCTGATGGCCGG 
CCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGC 
AGGCCGAGTCTGGGGAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCG 
GGTGTGGACCGAGGGGGAGCAGGAAGGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGA 
GTGCACGGCGAGGGGGGCCTCGCTAGAGGACCCCAGAGGGCCCGGGAGCCCCCACCCAGCCT 
GCCACCGAGCGACTCCCCGTGCGCAAGGCCCAGCAGCCACCGACGCGCCCTCCCGCCCCGGC 
AGACTCGCAGGCAGGGTCCAAGCGTC-CAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTT 
CTCCTCCCCGTGGGTGATCACGTAGCTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGA 
TGGAGGCGAACTGGAACATCTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCT 
CCGGGAGCGAATTACAAGCGCGCACCTGAAAA 



FIGURE 169 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50988 
xsubunit 1 of 1, 560 aa, 1 stop 
><MW: 58427, pi: 6.86, NX(S/T): 2 

MAGPTEAETGLAEPRALCAQRGHRTYARRWVFLLAISLLNCSNATLWLSFAPVADVIAEDLV 
LSMEQ INWLS LVYL WSTPFGVAAI WI LDS VGLRAAT I LGAWLNFAGS VLRMVPCMWGTQN 
PFAFLMGGQSLCALAQS LVI FS PAKLAALWFPEHQRATANMLATMSNPLGVIjVANVLS PVLV 

kkgedi plmlgvyti pagwcllst i clwes vpptppsagaas sts ekfldglklqlmwnka 
yvilavclggmigi sas fsalleq i lcasghssgfsglcgalf i tfg ilgalalgpyvdrtk 
hfteatkiglclfslacvpfalvsqlqgqtlalaatcsllglfgfsvgpvamelavecsfpv 
gegaatgmifvlgqaegilimlamtaltvrrsepslstcqqgedpldwtvslllmaglctff 
scilavffhtpyrrlqaesgeppstrnavggadsgpgvdrggagragvlgpstatpectarg 
asledprgpgsphpachratpraqgpaatdapsrpgrlagrvqasrfidpagshssfsspwvit 

Important features: 
Signal peptide: 

amino acids 1-44 

Transmembrane domains: 

amino acids 61-79, 98-112, 126-146, 169-182, 201-215, 248-268, 
280-300, 318-337, 341-357., 375-387, 420-441 

N-glycosylation site. 

amino acids 40-43 and 43-46 

Glycosaminoglycan attachment site. 

amino acids 468-471 



FIGURE 170 

GTCCC^C^TCCTGCTCAACTGGGTCAGGTCCCTOTTAG^ 

TAGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGGACaTCTCAGATCGCTTCCaATGAAGA 
TGGCCTTGCCTTGGGGTCCTGCTTGTTTCATAATCATC^ 

AAGGAGCACGGGGCTGATCAAGCCATCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACTCTAGTGGTT 
TCTGAATCTAGCCCACTTGGCGGTAAGCMGATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGGTGG 
CTACTTATTTCTTTTAGGGGATTGTCAGGAGG^ 

ATCTGGTACAGTGATCGGGAAGCTGTCCCAGGAACTGGGCCGGGAGGAGAGGCGGAGGCAAGCTGGGGCCGCCTT 
CCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCATTCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCG 
GCTGGATCGAGAGCAGCTGTGCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTT 
GGCTCTGATCCATGTGGAGATCCAAGTGCT'3GACATCAATGACCAC(^GCCACGGTTTCCCAAAGGCGAGCAGGA 
GCTGGAAATCTCTGAGAGCGCCTCTCTGCGAACCCGGATCCCCCTGGACAGAGCTCTTGACCCAGACACAGGCCC 
TAACACCCTGCACACCTACACTCTGTCTCCCAGTGAGCACTTTGCCTTGGATGTCATTGTGGGCCCTGATGAGAC 
CAAACATGCAGAACTCATAGTGGTGAAGGAGCTGGACAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGC 
CTATGACAATGGGAACCCCCCCAAGTCAGGTACCAGCTTGGTCAAGGTCAACGTCTTGGACTCCAATGACAATAG 
CCCTGCGTTTGCTGAGAGTTCACTGGCACTGGAAATCC^ 

GACCGCCACAGACCCTGACCAAGGCCCCAATGGGGAGGTGGAGTTCTTCCTCAGTAAGCACATGCCTCCAGAGGT 
GCTGGACACCTTCAGTATTGATGCCAAGAGAGGCCAGGTCATTCTGCGTCGACCTCTAGACTATGAAAAGAACCC 
TGCCTACGAGGTGGATGTTCAGGCAAGGGACCTGGGTCCCAATC 
GGTTCTGGATGTCAATGACAACATCCCAAGCATCCACGT 

TCTTCCCAAGGAC^GTTTTATTGCTCTTGTCATGGCAGATGACTTGGATTC^GGACACAATGGTTTGGTCCACTG 

CTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCAACACATACATGTTGCTAACCAATGC 

CACACTGGA(^GAGAGCaGTGGCCa^TATACCCTCACTCTGTTAGCCCAAGACCAAGGA 

AGCCAAGAAACAGCTCAGCATTCAGATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAGCAGGTATG^ 

AGTCTCCACGCGGGAAAACaACTTACCCTCTCTTCACCTCATTACCATCAAGGCTCATGATGCAGACTTGGGCAT 

TAATGGAAAAGTCTC&TACCGCIATCCAGGACTCCCC&GTTGCTC^ 

GGTCACTGCTCAGAGGTCACTGAACTATGAi\GAGATGGCCGGCTTTGAGTTCCAGGTGATCGCAGAGGACAGCGG 
GCAACCCATGCTTGCATCCAGTGTCTCTGTGTGGGTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGT 
CCAGCCTGTGCTC^GCGATGGAAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCC 
CATCGAGACTCCCAATGGCTTGGGCCC^GCGGGCACT 

CCTTTTGACAACCATTGTGGCAAGAGATGGAuGACTCGGGGGCAAATGGAGAGCCCCTCTACAGCATCCGCAATGG 
AAATGAAGCCCACCTCTTCATCCTCAACCCTGAT 

CATTGGGAGTGAGTGGGAGCTGGAGATAGTAGTAGAGGACCAGGGAAGCCCCCCCTTACAGACCCGAGCCCTGTT 

GAGGGTCATGTTTGTCACCAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGAT 

GCTGACGGTGATCTGCCTGGCTGTACTGTTGGGCATCTTCGGGTTGATCCTGGCTTTGTTCATGTCCATCTGCCG 

GACAGAAAAGAAGGACAACAGGGCCTACAACTGTCGGGAGGCCGAGTCCACCTAC^ 

CC^GAAAC^CATTCAGAAGGCAGACATCCACCTCGTGCCTGTGCTCAGGGGTCAGGC^ 

CGGGCAGTCCCACAAAGATGTGGACAAGGAGGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTT 

CCACCTCACCCCGACCCTGTACAGGACGCTGCGTAATCAAGGCAACCAGGGAGCACCGGCGGAGAGCCGAGAGGT 

GCTGCS^GACACGGTCy^CCTCCTTTTCAACCATCCCAGGCAGAGGAATGCCTCCCGGGAGAACCTGAACCTTCC 

CGAGCCCCAGCCTGCCACAGGCCAGCCACGTTCCAGGCCTCTGAAGGTTGCAGGCAGCCCCACAGGGAGGCTGGC 

TGGAGACCAGGGCA.GTGAGGAAGCCCCACAGAGGCCACCAGCCTCCTCTGCAACCCTGAGACGGCIAGCGACATCT 

CAATGGCAAAGTGTCCCCTGAGAAAGAATCAGGGCCCCGTCAGATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGC 

TGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGCT 

GTCCTTGCTGCATGAGGGCCAATTCCAGCCCIAAACCAAACCACCGAGGAAATAAGTACTTGGCCAAGCCAGGAGG 

CAGCAGGAGTGCAATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGGAGGCCAGACAGACCCAGA^ 

AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCC 

CAGCACAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCAC 

<^CQIACTACCGTGAC7^TGTGATCTCCCCGGATGCTG 

CAAGGCAGAGGCACCAGAGCTGAGCCCAACJvGGGACGAGGCTGGCCAGCA 

GCTGGAGATGCTGCTGGAACAGCGCTCGAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCTCTCGGT 

CTGCGGGAGGACCCTCAGTTTAGACTTGGCCIACCAGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGGTGG 

AAAGACGGGGACTGAGGGCAAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTGTGAACATACCTCAGACGCCT 

CTGGATCCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCACTAGCTAG 

CGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAGCCCCAGGACTAACAGCTGAC 

TGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGAGGACAGGGACGGTTTGTGGCTGAGATAAGTGTT 

TCCTGGGAAAACATATGTGGAGCACAAAGGGjTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACA 

AAAGGGTGGCCTTCTTGGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTAT 

C^TAAAGGAAAAGCAGTAAAAAAAAAAAAAAAAAAAA 



FIGURE 171 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T): 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQA 
GAAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQ 
VLD INDHQPRFPKGEQELE I SESASLRTRI PLDRALDPDTGPNTLHTYTLS PSEHFALDVI V 
GPDETKHAELIVVKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESS 
LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVIiDTFSIDAKTGQVILRRPL 
DYEKNPAYEVDVQARDLGPNP I PAHCKVL I KVLDVNDNI PS IHVTWASQPSLVSEALPKDSF 
IALVKAIDDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 
QGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 
YRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDA 
NDNAPEWQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 
IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIWEDQGS 
PPLQTRALLRVMFVTSVDHLRDS ARICPGALSMSMLTVI CLAVLLG I FGL I LALFMS I CRTEK 
KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEA 
GWDPCLQAPFHLTPTLYRTLRN0^3NQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 
QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 
ILRSLVRLSVAAFAERNPVEELTVDrSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 
RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPD 
PAWMARLSLPLTTNYRDNVISPDAAi^TEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSL 
LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCL 

Important features: 
Signal peptide: 
amino acids 1-13 
Transmembrane domain: 
amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 

Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 



FIGURE 172 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 
CAGACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 
GCAGGAGCCTTCCTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCT 
CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 
GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 
GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 
GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 
TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 
ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 
AACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 
ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 
CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 
TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 
GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 
AAGTGAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 
TTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 
AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 
CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 
TTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCCAACACATT 
TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 
CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 
AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 
TACCGCACCATAATCACTGAAGTCC1?TGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 
TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 
AGGCACCAGAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGT 
GGTTTCAAAATTTAGATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAAC 
AAACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 
CTATGACCATGAGTAGCATCAGCCAGAACATGAGAGGGAGAACTAACTCAAGACAATACTCA 
GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 
AGGTGAAAAATACACTGGAACTCTGC3GGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 
AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 
AAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



FIGURE 173 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEI 
LGVLNSSSRYFHWKmLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFF 
WKLGDPFPI LS PKHGI LS IEQLI SRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTD I 
LALERRLLQTMDMIISKKKRMAMJURRTMFQ 

QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVF 
DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITIiTKFFYAISS 
S KS SNVI VLLLAQ IMGMYFVS S VLLI RMSMPLEYRT 1 1 TEVLGELQFNFYHRWFDVT FLVS A 
LSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



FIGURE 174 



CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTAT(3TTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCi\ATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 



FIGURE 175 



GTGTTGCCCTTGGGGAGGGGAAGGG(3AGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 
TTTNTTGAATTCCGGGTTN3STGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 
TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 
NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 
TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 
CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 
TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 
TAGGAGATCCCTTTCCCATTCTC 



FIGURE 176 



CTCGCGCAGGGATCGTCCCATGGCCGGGGCTCGGAGCCGCGACCCTTGGGGGGCCTCCGGGATTTGCTACCTTTT 
TGGCTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAATCTGGACGTGATGGGTGCCTTGCGCAA 
GGAGGGCGAGCCAGGCAGCCTCTTCGGCTTCTCTGTGGCCCTGC^CCGGCAGTTGCAGCCCCGACCCCAGAGCTG 
GCTGCTGGTGGGTGCTCCCCAGGCCCTGGCTCTTCCTGGGCAGCAGGCGAATCGCACTGGAGGCCTCTTCGCTTG 
CCCGTTGAGCCTGGAGGAGACTGACTGCTACAGAGTGGACATCGACCAGGGAGCTGATATGC^?^GGAAAGCAA 
GGAGAACCAGTGGTTGGGAGTCAGTGTTCGGAGCCAGGGGCCTGGGGGCAAGATTGTTACCTGTGCACACCGATA 
TGAGGQ^AGGCAGCGAGTGGACCAGATCCTGGAG^CGCGGGATATGATTGGTCGCTGCTTTGTGCTCAGCCAGGA 
CCTGGCCATCCGGGATGAGTTGGATGGTGGGGAATGGAAGTTCTGTGAGGGACGCCCCCAAGGCCATGAACAATT 
TGGGTTCTGCCAGCAGGGCAC^GCTGCCGCCTTCTCCCCTGATAGCC^CTACCTCCTCTTTGGGGCCCCAGGAAC 
CTATAATTGGAAGGGCACGGCCAGGGTGGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGACGACGG 
TCCCTACGAGGCGGGGGGAGAGAAGGAGCAGGACCCCCGCCTCATCCCGGTCCCTGCCAACAGCTACTTTGGCTT 
CTCTATTGACTCGGGGAAAGGTCTGGTGCGTGCAGAAGAGCTGAGCTTTGTGGCTGGAGCCCCCCGCGCCAACCA 
CAAGGGTGCTGTGGTCaTCCTGCGCAAGGACAGCGCCAGTCGCCTGGTGCCCGAGGTTATGCTGTCTGGGGAGCG 
CCTGACCTCCGGCTTTGGCTACTCACTGGCTGTGGCTGACCTCAACAGTGATGGCTGGCCAGACCTGATAGTGGG 
TGCCCCCTACTTCTTTGAGCGCCAAGAAGAGCTGGGGGGTGCTGTGTATGTGTACTTGAACCAGGGGGGTCACTG 
GGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCATGTTCGGGATCAGCCTGGCTGTCCTGGGGGA 
CCTCAACCAAGATGGCTTTCCAGATATTGCAGTGGGTGCCCCCTTTGATGGTGATGGGAAAGTCTTCATCTACCA 
TGGGAGCAGCCTGGGGGTTGTCGCCAAACCTTCACAGGTGCTGGAGGGCGAGGCTGTGGGCATCAAGAGCTTCGG 
CTACTCCCTGTCAGGCAGCTTGGATATGGATGGGAACCAATACCCTGACCTGCTGGTGGGCTCCCTGGCTGACAC 
CGC^GTGCTCTTCAGGGCCAG&CCCaTCCT^ 

GGAGCAGCCC^CTGTGCTGGCGGCCACTCGGTCTGTGTGGACCTAAGGGTCTGTTTCAGCTAC^TTGCAGTCCC 

CAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTGTTAGATGCGGACACAGACCGGAGGCTCCGGGGCCAGGT 

TCCCCGTGTGACGTTCCTGAGCCGTAACCTCK3AAGAACCCAAGCACCAGGCCTCGGGCACCGTGTGGCTGAAGCA 

CCAGCATGACCGAGTCTGTGGAGACGCCATGTTCCAGCTCCAGGAAAATGTCAAAGACAAGCTTCGGGCCATTGT 

AGTGACCTTGTCCTACAGTCTCCAGACCCerCGGCTCCGGCGACAGGCTCCTGGCCAGGGGCTGCCTCCAGTGGC 

CCCCATCCTCAATGCCCA.CCA.GCCCAGCACCCAGCGGGCAGAGATCCACTTCCTGAAGCAAGGC 

CAAGATCTGCCaGAGCAATCTGCAGCTGGTCCACGCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTCCAACC 

TCTGCCCATGGATGTGGATGGAACAACAGCC^CTGTTTGCACTGAGTGGGCAGCCAGTCATTGGCCTGGAGCTGAT 

GGTCACC^^CCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGATGATGCCCATGAAGCCCAGCTCCTGGT 

CATGCTTCCTGACTCaCTGCACTACTCAGGGGTCCGGGCCCTGGACCCTGCGGAGAAGCCACTCrGCCTGTCCAA 

TGAGAATGCCTCCCATGTTGAGTGTGAGCTGGGGAACCCCATGAAGAGAGGTGCCCAGGTCACCTTCTACCTCAT 

CCTTAGCACCTCCGGGATCAGCATTGAGACCACGGAACTGGAGGTAGAGCTGCTGTTGGCCACGATCAGTGAGCA 

GGAGCTGCATCCAGTCTCTGCACGAGCCCGTGTCTTCATTGAGCTGCCACTGTCCATTGCAGGAATGGCCATTCC 

CCAGCAACTCTTCTTCTCTGGTGTGGTGAGCjGGCGAGAGAGCCATGCAGTCTGAGCGGGATGTGGGCAGCAAGGT 

CAAGTATGAGGTCACGGTTTCCAACCS^GGCCAGTCGCTCAGAACCCTGGGCTCTGCCTTCCTCAACATC^ 

gcctcatgagattgccaatgggaagtggttgctgtacccaatgcaggttgagctggagggcgggcaggggcctgg 

^2^^ GCTTT ^ TCTCC ^^ CC ^ CaTCCTCCACCTGGAT GTGGACAGTAGGGATAGGAGGCGGCGGGA 
GCTGGAGCCACCTGAGCAGCAGGAGCCTGGTGAGCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGC 
TGAGAAGAAGAAAAACATCACCCTGGACTGCGCCCGGGGCACGGCCAACTGTGTGGTGTTCAGCTGCCCACTCTA 
CAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGCCGTCTCTGGAACAGCACCTTTCTGGAGGAGTACTCAGC 
TGTGAAGTCCCTGGAAGTGATTGTCCGGGCCAACATCACAGTGAAGTCCTCCATAAAGAACTTGATGCTCCGAGA 
TGCCTCCACAGTGATCCCAGTGATGGTATACTTGGACCCCATGGCTGTGGTGGCAGAAGGAGTGCCCTGGTGGGT 
CATCCTCCTGGCTGTACTGGCTGGGCTGCTGGTGCTAGCACTGCTGGTGCTGCTCCTGTGGAAGATGGGATTCTT 
CAAACGGGCGAAGCACCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAAGATTCCTCGGGAAGACCGACAGCA 
GTTCAAGGAGGAGAAGACGGGCACCATCCTGAGGAACAACTGGGGCAGCCCCCGGCGGGAGGGCCCGGATGCACA 
CCCC^TCCTGGCTGCTGACGGGCATCCCGAGCTGGGCCCCGATGGGCATCCAGGGCCAGGCACCGCCTAGGTTCC 
CATGTCCCAGCCTGGCCTGTGGCTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGATGAAGAGGGTAGAGT 
GGGCTGCTGGTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCAGGGGCACAGACCTCTCCCACCCACAAGAAC 
TCCTCCCACCCAACTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGACAGGGCCATGGGGTAGGG 
TGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGATCCTAATCCCTTCCTCTCCCATTCACCCTGT 
GTAACAGGACCCCAAGGACCTGCCTCCCCGGAAGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCACTGA 
CTCAGGCTGCTCCTTCTCTAGTTTCCCCTCTCATCTGACCTTAGTTTGCTGCCATCAGTCTAGTGGTTTCGTGGT 
TTCGTCTATTTATTAAAAAATATTTGAGAACAAAAAAAAAAAAAAAAAAAA 



FIGURE 177 

>< /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ ss . DNA5 5 73 7 
xsubunit 1 of 1, 1141 aa, 1 stop 
xMW: 124671, pi: 5.82, NX(S/T): 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQL 
QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGADMQKESKENQWL 
GVS VRSQGPGGKI VTCAHRYEARQRVDQ I LETRDMI GRC FVLSQDLAI RDE LDGGEWKFCEG 
RPQGHEQFGFCQQGTAAAFSPDSHYLLFGAPGTYNWKGTARVELCAQGSADLAHLDDGPYEA 
GGEKEQDPRL I PVPANSYFGFS IDSGKGLVRAEELSFVAGAPRANHKGAWI LRKDSASRLV 
PEVMLSGERLTSGFGYSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVYVYLNQGGHWAGI 
SPLRLCGSPDSMFGISLAVLGDLNQDGFPDIAVGAPFDGDGKVFIYHGSSLGWAKPSQVLE 
GEAVGIKSFGYSLSGSLDMDGNQYPDLLVGSLADTAVLFRARPILHVSHEVSIAPRSIDLEQ 
PNCAGGHSVCVDLRVCFSYIAVPSSYSPTVALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 
HQASGTVWLKHQHDRVCGDAMFQLQENVKDKLRAIWTLSYSLQTPRLRRQAPGQGLPPVAP 
I LNAHQPSTQRAE IHFLKQGCGEDKI CQSNLQLVHARFCTRVSDTEFQPLPMDVDGTTALFA 
LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 
ENASHVECELGNPMKRGAQVTFYLILSTSGISIETTELEVELLLATISEQELHPVSARARVF 
IELPLSIAGMAIPQQLFFSGVVRGERAMQSERDVGSKVKYEVTVSNQGQSLRTLGSAFLNIM 
WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 
RQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAAVLHVWGRLWNSTFLEEY 
S AVKSLE VI VRANI TVKSS IKNLMLFIDASTVI PVMVYLDPMAWAEGVPWWI LLAVLAGLL 
VLALLVLLLWKMGFFKRAKHPEATVPQYHAVKIPREDRQQFKEEKTGTILRNNWGSPRREGP 
DAHPILAADGHPELGPDGHPGPGTA 



Important features : 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 1040-1062 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 



FIGURE 178 



CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 

AAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCT 

CACAACAAGATGCTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 

GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 

TGGATGATAAACAATGGCTCACCAC!AATCTCTCAGTATGACAAGGAAGTCGGACAGTGGAAC 

AAATTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGA 

TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 

GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 

AAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTG 

CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 

AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGGACATTGC 

CCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCT 

GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 

GTCAAAACAAGAAGACAAAAACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATC 

TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 

GCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAATGAACAGTGTACCAAGG 

CATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTGCTAC 

TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 

AGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGC 

CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 

ATGGGATCCAGAATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAGATTT 

TGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATG 

ATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGAC 

CATGATGTATACATTTGATTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTA 

CAAAAATGATAGCCTATTTAAAATmTCTTCTTCCCCAATAACAAAATGATTCTAAACCTCA 

CATATATTTTGTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAAT 

AAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGCAGTCTA 

GTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 

AAATCTTCCATAAGCAAAATGACAGC-TAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAG 

ATAATTCTAAGTGAAATTTAAAATAJiATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 

AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 

GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 



FIGURE 179 



>< / u s r / s eqdb2 / s s t /DNA/Dnas eqs . min / s s . DNA4 9 8 2 9 
xsubunit 1 of 1, 436 aa, 1 stop 
><MW: 49429, pi: 4.80, NX(S/T): 0 

MLKVS AVLC VCAAAWC S QS LAAAAAVAAAGGRSDGGNFLDDKQWLTT I SQYDKEVGQWNKFR 
DEVEDDYFRTWS PGKPFDQALDPAKDPCLKMKCSRHKVCI AQDSQTAVCI SHRRLTHRMKEA 
GVDHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
SDKPTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTSILPI 
CKDSLGWMFNRLDTNYDLLLDQSELRS I YLDKNEQCTKAFFNSCDTYKDSLI SNNEWCYCFQ 
RQQDPPCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAIDFEISGDFASGDFHEWnDDEDDEDDIMNDEDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern. 

amino acids 246-267 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 



FIGURE 180 



CAGACTCCAGATTTCCCTGTCAACCACGAGGAGTCCA.GAGAGGAAACGCGGAGCGGAGACAACAGTACCTGACGC 
CTCTTTCAGCCCGGGATCGCCCCkGC^^ 

GCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTT 
CCCGCCGGCCAGAAGGAGTGCTTCTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTA 
GATGGAGCAGGATTAGATATTGATTTCC&TCTTC^ 

TCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGACAATACATTCAGCACCATT 
TCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAATATGGGAGAACAGGCACAAGAACAAGAAGATTGGAAG 
AAATATATTACTGGCACAGATATATTGGATATGAAACTGGAAG^ 

AGACTAAGCAAAAGTGGGCACATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAACATACAAGAAAGC 
AACTTTGATAGAGTCAAl^TCTGGTCTATGGTTAATTTAGTGGTC^TGGTGGTGGTGTCAGCCATTCAAGTTTAT 
ATGCTGAAGAGTCTGTTTGAAGATAAGAGGAAAAGTAGAAC TTAAA ACTCCAAACTAGAGTACGTAACATTGAAA 
AATGAGGCATAAAAATGCAATAAACTGTTACAGTCmGACC^^ 

AAAGTAGGAAACAGGTATAATTTTAATGTGAAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAG 
TTGTACTTAAGTGTGTAACAGGAATATTTTGCAGAATATAGGTTTAACTGAATGAAGCCATATTAATAACTGCAT 
TTTCCTAACTTTGAAAAATTTTGCAAATGTCT^ 

AGTCTGTTTraAACAGGTTCTATTACCCAGAACTTTTTTGTAAATGCGGCAGTTACAAATTAACTGTGGAAGTTT 
TCAGTTTTAAGTTATAAATCACCTGAGAATTACCT^ 

CTTTTCTCTATTTACATATGCATCTCTCCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTG 
AGATTTTTATAACCAAATACATTTCAGTGTAACAT 

CCAAAAGCTGACATTTTCACGATTCTTAAAAACACAAAGTTACACTTACTAAAATTAGGACATGTTTTCTCTTTG 
AAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTCTCTAACCCTTAACTAAAGTGTAGGA 
TTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGTATCTGTCAAGTTAATATCTGTCAACAGTTAA 
TAATCATGTTATGTTAATTTTAACATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATGAAGGAAATA 
TTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGGAA 
AGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTATAAATAAATATCTAGA 
ATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGGAGATGATTAATTTCCAGTTAGCTGGAAG 
AAACTTTGGCTGTAGGTTTTTATTTTCTACAAGAATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTTATA 
AAATGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTAT 
TAAAATGGCCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTTTTAAA 
CACCTAAATGTGAATAACCCATATATACAACAAAGTTTC^ 

TC^GTACTAGTAATTTAACTTC^TCaTGAATGAACTATAATTTTTAAGTTATGCCCATTTATAACGTTGTTTAT 
GACTACATTGTGAGTTAGAAACAAACTTAAAATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATT 
CTTGATGAGCAATAATGATAACCAGAGAGTGATTTCATTT^ 

TCTTAGGCCCCTGGGAGAAGAGCaGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAATGCCGTAT 
ATGATC^^TTACCTTAATTGGCCAAGAAAATGCTTCAGGTGTCTAGGGGTATCCTCTGCAACACTTGCAGAACAA 
AGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCAAATTTACA 
GTACCATAACTAATAAAGCAGGGTACAGATATAAACTACTGCft.TCTTTTCTATAAAACTGTGATTAAGAATTCTA 
CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCT 
ACATGTATGATTTGTGCCACTGATCTTAAACCTATGATTCAGTAACTTCTTACCATATAAAAACGATAATTGCTT 
TATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAAGTAAAAAGACAGATATTTAAGAGG 
CATAACCaAAAAAGCAAAACTTGTAAACAGAGTAAAAATCTTTAATATTTCTAAAGACATACTGTTTATCTGCTT 
C^TATGCTTTTTTTAATTTCACTAT^^ 

AAC^GCTCATTTTGTCTTTTTCAATATACAAATTTTAAAAATACTACAATATTTAACTAAGGCCCAACCGATTTC 

CATAATGTAGCAGTTACCGTGTTCACCTCACACrAAGGCCTAGAGTTTGCTCTGATATGCATTTGGATGATTAAT 

GTTATGCTGTTCTTTCATGTGAATGTCAAGAGATGGAGGGTGTTTGTAATTTTATGGTAAAATTAATCCTTCTTA 

CACATAATGGTGTCTTAAAATTGACAAAAAATGAGCACTTACAATTGTATGTCTCCTCaAATGA^ 

GTGAAATTTTAAAAGACATTGATTCCGCATGTAAGGATTTTTCATCTGAAGTACAATAATGCACAATCAGTGTTG 

CTCAAACTGCTTTATACTTATAAACAGCCATCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATAATAA 

AAATTATCAAAGGAAAA 



FIGURE 181 

></usr/seqdb2/sst/DNA/Dnaseq;s .min/ss .DNA52196 
xsubunit 1 of 1, 229 aa, 1 stop 
><MW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGA?kGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEY 
QVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFEL 
ILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRN 
IQESNFDRVNFWSMVNLVVMVWSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain : 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site. 

amino acids 55-62 



FIGURE 182 

CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAGACAAAGCATACCAGAT 
CTCACCAGAGAGTCGCAGACAC TATG CTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 
TAACAGCTACTCATACATCTGGATTGGGCTCCATGACCCCACACAGGGCTCTGAGCCTGATG 
GAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGACTAGGGCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTGAGATC 
TCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 183 



</usr/seqdb2/sst/DNA/Dnaseqs ,min/ss.DNA56965 
<subunit 1 of 1, 175 aa, I stop 
<MW: 19330, pi: 7.25, NX(S/T): 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVMNYFAWEKNPST I LNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 

amino acids 1-26 

C-type lectin domain signature. 

amino acids 146-171 



FIGURE 184 



CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGAGC 
ACAGAGGAGTGGGCCGGGACC ATG CGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGC 
TGCCTGCGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGT 
CGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTAC 
TCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAA 
GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 
CTCCCACTCTTGAGCCTCCGACTGTAGAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 
GCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAAAAAAAAAAAA 



FIGURE 185 

< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 6405 
<subunit 1 of 1, 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX(S/T): 1 

MRGTRLALrjyLVLAACGELAPALRCYVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVYP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCOTELCl^GAPALNSLHCGALTLLPLLSLRL 

Important features: 
Signal peptide: 

amino acids 1-17 

N-glycosylation site. 

amino acids 46-49 



FIGURE 186 

CTGCAGTCAGGACTCTGGGACCGCAGJGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTG<rAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTT 
CTGATCAAAATCATTTCCGAGGAGAi\ATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 
GACACTTCTGGTCCAAGATCTGTAA?i.CCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTC 
TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACC 
TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCA 
AGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATT 
TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAAAATTTAACATTTAAAAAAAAAAAAA 



FIGURE 187 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA57530 
<subunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX(S/T) : 1 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSA 
APGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRKRCMRH 
AMCCPGNYCKNGI CVS SDQNHFRGE I EET I TESFGNDHSTLDGYSRRTTLS SKMYHTKGQEG 
SVCLRSSDCAS GLCCARHFWSKI CKPVLKEGQVCTKHRRKGSHGLE I FQRCYCGEGLS CRIQ 
KDHHQASNSSRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 



FIGURE 188 



TGTGTTTCCCTGCAGTCAGAATTTGSGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 
ACGGGAAGGTTTTGTGGGACCCAGGTTGAAATGACGGTCATTTTTTTTTCTTTCTCCTTCNG 



GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGG 



F3CGURE 18 



gaggaacctaccggtaccggccgcgcgctggtagtcgccggtgtggctgcacctcaccaatcccgtgcgccgcgg 
ctgggccgtcggagagtgcgtgtgcttctct:cctgcacgcggtgcttgggctcggccaggcggggtccgccgcca 
gggtttgaggatgggggagtagctacaggaagcgaccccgcgat^ 

agtattagaaatgagctgaagacc^ttc^c^gattaatatttttggggacagatttgtgatgcttgattcaccct 
tgaagtaatgtagacagaagttctgaaatttgcatattacatcaa 

ttaaatc^gaacttgcataagaaagagaatgggagtctggttaaataaagatgactatatcagagacttgaaaag 
gatcattctctgttttctgatagtgtatatggccattttagtgggcacagatcaggatttttacagtttacttgg 
agtgtccaaaactgcaagcagtagagaaataagacaagcttt 

aaacccgaataacccaaatgcacatggcgattttttaaaaataaatagagcatatgaagtactcaaagatgaa.ga 
tctacggaaaaagtatgacaaatatggaga?kaagggacttgaggataatcaaggtggccagtatgaaagctggaa 
ctattatcgttatgattttggtatttatgatgatgatcctgaaatcataacattggaaagaagagaatttgatgc 
tgctgttaattctggagaactgtggtttgtaaatttttactccccaggctgttcacactgccatgatttagctcc 
cacatggagagactttgctaaagaagtggatgggttacttcgaattggagctgttaactgtggtgatgatagaat 

GCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATA 
TCATGGAGACAGATCAAAGGAGAGTTTAGTGAGTTTTGCAATGCAGCATGTTAGAAGTACAGTGACAGAACTTTG 
GACAGGAAATTTTGTCAACTCCATACAAAC1?GCTTTTGCTGCTGGTATTGGCTGGCTGATCACTTTTTGTTCAAA 
AGGAGGAGATTGTTTGACTTCACAGACACG/iCTCAGGCTTAGTGGCATGTTGTTTCTCAACTCATTGGATGCTAA 
AGAAATATATTTGGAAGTAATACATAATCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTT 
GGCTCATCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACT 
AAAAACTCTACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGACATCTGTAGTAA 
TCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGAC^GGAACCAAAGAATATGAAATTCATCATGGAAA 
GAAGATTCTATATGATATACTTGCCTTTGCCIAAAGAAAGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAA 
TTTTCCTGCC^ATGACAAAGAACCATGGCTTCTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACT 
ACCAGAGTTACGAAGAGCATCAAATCTTCT'iPTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGTTCATGA 
GGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACAGTGGTATTCAACCAGTCCAACATTCATGAGTA 
TGAAGGACATGACTCTGCTGAACAAATCTTGGAGTTGATAGAGGATCTTATGAATCCTTCAGTGGTCTCCCTTAC 
ACCCACCACCTTCAACGAACTAGTTACACAAAGAAAACAC^^ 

GTGTCATCCTTGCCAAGTCTTAATGCCAGA?iTGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGG 
CAGTATAGATTGCCAACAGTATCATTCTTTTTGTGCCCAGGAAAACGTTCAAAGATACCCTGAGATAAGATTTTT 
TCCCCC^AAATCAAATAAAGCTTATC^GTATCAC^^ 

CTGGGGTCTAGGATTTTTACCTCAAGTATCC^CAGATCTAACACCTC^GACTTTC^GTGAAAAAGTTCTAC^GG 
GAAAAATCaTTGGGTGATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCT 
CTTGGCTAGGATGATTAAAGGAAAAGTGAAJiGCTGGAAAAGTAGACTGTCAGGCTTATGCTCAGACATGCCAGAA 
AGCTGGGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGAAAGAGCAAAGAGAAATTTTCAAGAAGAGCSV 
GATAAATACCAGAGATGCAAAAGCAATCGCTGCCTTAATAAGTGAAAAATTGGAAACTCTCCGAAATCAAGGCAA 
GAGGAATAAGGATGAACTTTGATAATGTTG/VAGATGAAGAAAAAGTTTAAAAGAAATTCTGACAGATGACATCAG 
AAGACACCTATTTAGAATGTTACATTTATGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCA 
GAATTATCTACAGCACTGGTGTAAAAGAAGGGTCTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTA 
GACTTTGCAGGCTATAATATATGGTTCACIACATGAGAACAAGAATAGAGTCATCATGTATTCTTTGTTATTTGCT 
TTTAACAACCTTTAAAAAATATTAAAACGATTCTTAGCT 

GACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCT 

ATCTACaTAAATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCACGTTTTTTGGCTGACCTGAAAAGAGGTAACT 

TAGTTTTTGGTCACTTGTTCTCCTAAAAATGCTATCCCTAACCATATATTTATATTTCGTTTTAAAAACACCCAT 

GATGTGGCACAGTAAACS^CCCTGTTATGCTGTATTATTATGAGGAGATTCTTC^TTGTTTTCTTTCCTTCTC^ 

AAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAGCAGTATATGTGCACACAGTAAGTACAC 

AAATTTGAGC^CAGTAAGTGCACAAATTCTGTAGTTTGCTGTATCATCCAGGAAAACCTGAGGGAAA 

TAGCAATTAACTGGGCATTGTAGAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATA 

TGTGTTCATGTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATAT 

TTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTGG 

AATGATACTGTAGTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTC^GATAAATATTGACATAATA 
ACTGAAGTTATTTTTATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGA 
CTCAAAGAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGTAAAAATT 
CCAATCAGTCAAAAGAGGTCAATGAATTAA?AGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAA 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56439 
<subunit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGWmKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKIJUjKL 
HPDKNPKnSTPNAHGDFLKINRAYEVKKDEDLRKKYDKY'GEKGLEDNQGGQYESWNYYRYDFGI 
YDDDPEIITLERREFDAAVNSGELWFVNFYSPGCSHCHDLAPTWRDFAKEVDGLLRIGAVNC 
GDDRMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFVNS 
IQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 
TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQP 
SLAVFKGQGTKEYEIHHGKKILYDir^FAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 
PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 
AEQILEFIEDLMNPSWSLTPTTFNELVTQRKHNEVWMVDFYSPWCHPCQVLMPEWKRMART 
LTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 
FLPQVSTDLTPQTFSEKVLQGKNHW-^IDFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDC 
QAYAQTCQKAGIRAYPTVKFYFYERMCRNFQEEQINTR^ 

Important features: 

Endoplasmic reticulum targeting sequence. 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 

N-glycosylation site. 

amino acids 484-487 



FIGURE 191 

AGACAGTACCTCCTCCCTAGGACTACACAAGGACTGAACCAGAAGGAAGAGGACAGAGCAAA 
GCCATGAACATCATCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGA 
GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 
TTACTGGAGCTGGGCATGGAATAGGCAGGCAGACTACTTATGAATTTGCAAAACGACAGAGC 
ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 
ACTAGGCGTCACTGCGCATGCGTATGTGGTAGACTGCAGCAACAGAGAAGAGATCTATCGCT 
CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 
GTATATCCAGCCGATCTTCTCAGCACCAAGGATGAAGAGATTACCAAGACATTTGAGGTCAA 
CATCCTAGGACATTTTTGGATCACAi^AAGCACTTCTTCCATCGATGATGGAGAGAAATCATG 
GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 
TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 
GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGAATACTGGGTTCACCA 
AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 
GATGGAATACTTACCAATAAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTTCTGAG 
ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 
TTGAAGCAGTGGTTGGCCACAAAATCAAAATGAAATGAATAAATAAGCTCCAGCCAGAGATG 
TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 
TCAGTCCTGATAATATTAAAAACATTGGTTTGGCACTAGCAGCAGTCAAACGAACAAGATTA 
ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 
TCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTTTTCTTTAAGATAT 
TTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 
TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 
AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 
ATCTCAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 
TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 
AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 
TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 
TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 
TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 
CAATGGACCCAAGAGAAGAA 



FIGURE 192 

</usr/segdb2/sst/DNA/Dnaseqs.min/ss.DNA56409 
<subunit 1 of 1, 3 00 aa, 1. stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNIILEILLLLITIIYSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQSI 
LVLWD I NKRGVEETAAECRKLGVTAHAYWDCSNREE I YRS LNQVKKE VGDVT I WNNAGTV 
YPADLLSTKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYC 
S S KFAAVGFHRGLTS ELQALGKTGI KTSCLCPVFVNTGFTKNPSTRLWPVLETDEWRSL ID 
GI LTNKKMI FVPSYINI FLRLQKFLPERASAI LNRMQNIQFEAWGHKI KMK 

Important features: 
Signal peptide: 

amino acids 1-19 

■1 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 

amino acids 165-202, 37-49, 112-122 and 210-219 



FIGURE 193 



CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 
ATGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 
AGGATGACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 
GCTGATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 
CTAGGCCGCACACGGGGCCGCCGCI'GCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 
GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 
CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 
GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 
CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 
CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 
CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 
AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 
CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 
CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 
TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 
CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 
GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 
CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 
CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 
AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 
ACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 
GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 
GAGACTGAAAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 
AGTTTTTTTATGACCTACGATTTTG(!IAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
ATTGAGTACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACGA 
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< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s - DNA5 6112 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T): 4 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RSVLRGFCANSSLAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLS 
GSLLHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPF 
VRLISAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 
ASSWEEDWFAKIPIAWRQQLYKLYEADFVLFGYPKPENLLRD 

Important features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine -rich region protein 

amino acids 329-332 
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TCGGGCCAGAATTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGAGGTGA 
AAGAGGCCCAGAGTAGAGAGAGAGAGAGACCGACGTACACGGGATGGCTACGGGAACGCGCT 
ATGCCGGGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGC 
GCCTTCGTGAACAGCGGGGCCCGAGTGGTTATCTGCGACAAGGATGAGTCTGGGGGCCGGGC 
CCTGGAGCAGGAGCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATG 
TGAAGACCCTGGTTTCTGAGACCATCCGCCGATTTGGCCGCCTGGATTGTGTTGTCAACAAC 
GCTGGCCACCACCCACCCCCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCT 
GCTGGAGCTGAACCTACTGGGGACGTACACCTTGACCAAGCTCGCCCTCCCCTACCTGCGGA 
AGAGTCAAGGGAATGTCATCAACATCTCCAGCCTGGTGGGGGCAATCGGCCAGGCCCAGGCA 
GTTCCCTATGTGGCCACCAAGGGGGCAGTAACAGCCATGACCAAAGCTTTGGCCCTGGATGA 
AAGTCCATATGGTGTCCGAGTCAACTGTATCTCCCCAGGAAACATCTGGACCCCGCTGTGGG 
AGGAGCTGGCAGCCTTAATGCCAGACCCTAGGGCCACAATCCGAGAGGGCATGCTGGCCCAG 
CCACTGGGCCGCATGGGCCAGCCCGCTGAGGTCGGGGCTGCGGCAGTGTTCCTGGCCTCCGA 
AGCCAACTTCTGCACGGGCATTGAACTGCTCGTGACGGGGGGTGCAGAGCTGGGGTACGGGT 
GCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCCCTTC CTGAT TTCTCTCATTT 
CTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAACCTGTATCAGATGCAGC 
CCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACCCTGCAGGTTCCCAT 
AAAAACGATTTGCAGCC 
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</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56045 
<subunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1 

MATGTRYAGKVVVVTGGGRG I GAGI VRAFVNSGARWT CDKDESGGRALEQE LPGAVF I LCD 
VTQEDDVKTLVSETIRRFGRLDCVVNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALPYLRKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKALALDESPYGVRVNCISPGN 
IWTPLWEELAALMPDPRATIREGMLi^QPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELGYGCKASRSTPVDAPDIPS 

Important features: 
N-glycosylation site. 

amino acids 13 8-141 

Short -chain alcohol dehydrogenase family protein 

amino acids 10-22, 81-91, 134-171 and 176-185 
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AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAATGGACTGGCCTCACAACCTG 
CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAA 
GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 
TGGTGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATG 
GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 
GTGGATGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 
GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 
ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 
CCTCTGCCCGCCACCGCCCCGCACAC3GGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 
CTGTGGGCTGCACCTGCATCTTC TGAA TCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 
CCATCCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 
GCAAG 



FIGURE 198 

</usr/seqdb2/sst/DNA/Dnase5qs .min/ss .DNA59294 
<subunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58, NX(S/T): 1 

MDWPHNLLFLLT I S I FLGLGQPRS PKS KRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEE Y 
ERNI EEMVAQLRNS S ELAQRKCEVNLQLWMSNKRSLS PWGYS INHDPSRI PVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-T.7 

amino acids 96-180. 
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GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 
CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 
GCGCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 
CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 
TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 
TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 
TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 
GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 
TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATC 
AGTCGAGCCTCTGACTGGCTGGAAA.TCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 
TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 
CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 
TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 
GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 
GAGGAAAAAGATGATTCAAATGAAG.AAGAAAACAAAGACAGCCTTGTAGATGATGAAGAAGA 
GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 
CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 
ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 
TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGAC 
TGTAGATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 
CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 
CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 
GACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 
CAAGTTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATC 
AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCT 
CCTGAGCATCCCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 
TTTTGTCAAATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 
ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 
TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTGAGGCTCT 
CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 
TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 
GACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAAAAGGAAGTAAGGATT 
TTACAAGACAGATTAAAAAAAAATTGTTTTGTCCAAAATATAGTTGTTGTTGATTTTTTTTT 
AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 
CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCT C AAGGGGTTC CCTGGGTCTTGAAC 
TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 
GAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 
TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 
GCTAATTTCTTT 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56433 
<subunit 1 of 1, 349 aa, 1 stop 
<MW: 38952, pi: 4.34, NX(S/T): 1 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAP 
WCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRG 
PGIFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAW 
CSYVFFVIATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 
DDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTRE 
EVEPEEAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 



Flavodoxin proteins 

amino acids 173-187 
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ATCTGGTTGAACTACTTAAGCTTAATTTGTTAAACTCCGGTAAGTACCTAGCCCACATGATT 
TGACTCAGAGATTCTCTTTTGTCCACAGACAGTCATCTCAGGGGCAGAAAGAAAAGAGCTCC 
CAAATGCTATATCTATTCAGGGGCTCTCAAGAACA ATGG AATATCATCCTGATTTAGAAAAT 
TTGGATGAAGATGGATATACTCAATTACACTTCGACTCTCAAAGCAATACCAGGATAGCTGT 
TGTTTCAGAGAAAGGATCGTGTGCTGCATCTCCTCCTTGGCGCCTCATTGCTGTAATTTTGG 
GAATCCTATGCTTGGTAATACTGGTGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCC 
AGCCCTTGTCCTCCTAATTGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACT 
AAATTCCTGGGATGGAAGTAAAAGAC!AATGCTGGCAACTGGGCTCTAATCTCCTAAAGATAG 
ACAGCTCAAATGAATTGGGATTTATAGTAAAACAAGTGTCTTCCCAACCTGATAATTCATTT 
TGGATAGGCCTTTCTCGGCCCCAGACTGAGGTACCATGGCTCTGGGAGGATGGATCAACATT 
CTCTTCTAACTTATTTCAGATCAGAACCACAGCTACCCAAGAAAACCCATCTCCAAATTGTG 
TATGGATTCACGTGTCAGTCATTTATGACCAACTGTGTAGTGTGCCCTCATATAGTATTTGT 
GAGAAGAAGTTTTCAATGTAAGAGGAAGGGTGGAGAAGGAGAGAGAAATATGTGAGGTAGTA 
AGGAGGACAGAAAACAGAACAGAAAAGAGTAACAGCTGAGGTCAAGATAAATGCAGAAAATG 
TTTAGAGAGCTTGGCCAACTGTAATCTTAACCAAGAAATTGAAGGGAGAGGCTGTGATTTCT 
GTATTTGTCGACCTACAGGTAGGCTAGTATTATTTTTCTAGTTAGTAGATCCCTAGACATGG 
AATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTTTTGAGATAGGGTCT 
CACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAGCTATCTCTCGC 
CTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAATTTTTGGTG 
TTTTTTGTAGAGACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGCTTAAG 
TGATCTGCCCGCCTTGGCCTCCCAAi^GTGCTGGGATTACAGATGTGAGCCACCACACCTGGC 
CCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGA 
ATCTTAATTTCTGGCTCTATCAGAGTTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGT 
GTGTTGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTATGAACTGAGGGTGAAAT 
ATATTTCTGAATAGCTAAATGAAGAi\ATGGGAAAAAATCTTCACCACAGTCAGAGCAATTTT 
ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTAAAAAGCAGGAACTCCTA 
CTTTTTCTTTATCAATTAAATAGCTC!AGAGAGTACATCTGCCATATCTCTAATAGAATCTTT 

CACGATCTCGGCTCACCGCAACCTC<I!GCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 

cccaagtagctgggattacagtcagcicaccaccacacccggctaattttgtatttttttagt 
agagacagggtttctccatgtcggtc:agggtagtcccgaactcctgacctcaagtgatctgc 
ctgcctcggcctcccaagtgctgggattacaggcgtgagccactgcacccagcctagaatct 

TGTATAATATGTAATTGTAGGGAAACTGCTCTCATAGGAAAGTTTTCTGCTTTTTAAATACA 
AAAATACATAAAAATACATAAAATCTGATGATGAATATAAAAAAGTAACCAACCTCATTGGA 
ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 
ACCATTTTTTTCAGTAATTACTGTAAAATGGTATTATTGGAATGAAACTATATTTCCTCATG 
TGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTATTTTTATTTCCAATGGATA 
TTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTAATATGTGAAAAGAAAT 
TGTACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTTATGTGGATTTCAT 
TTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53 912 
<subunit 1 of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, NX(S/T): 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCAASPPWRLIAVILGILCLVILVIAV 
VLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 
VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 
CSVPS YS I CEKKFSM 

Important features: 

Type XI transmembrane domain: 

amino acids 45-65 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids 34-67 and 70-200. 
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GGAAGGGGAGGAGCAGGCCACACAGGCACA'SGCCGGTGAGGGACCTGCCCAGACCTGGAGGGTCTCGCTCTGTCA 

C^CAGGCTGGAGTGCAGTGGTGTGATCTTGSCTC^TCGTAACCTCC^CCTCCCGGGTTC^GTGATTCTCATGCC 

TCAGCCTCCCGAGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAA AATG ACTCCCCAG 

TCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTC 

GAAGACTTTCGCTTCTGCAGCCaGCGGAACCAGAC^^ 

CGCATCTCCATCGAGAACTCCGAAGAGGCCCTC^^ 

TCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTC 

TATGGCAAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTG 

GCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTC 

GCC^GCTTCACCTTCTCCTTCC&C^GTCCTC^ 

AAAAGGGACCTCC^GCTGCTCAGCCAGTTCCTGAAGCATC 

GCCAGCCAGCAGTTGCAGAGCCTGGAGTCG.?!AACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCGAG 

GAGGACCGGATCAACGCGA.CGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGG 

CAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGC 

CGGAGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCC 

AGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACC^ 

GTGCTCACTTTCCAGCaCCAGCTACaGCCGAAGA^ 

TTGAGC^GCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGC 
AACCACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCAC^AGCACTACCTGAGC 
CTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCAGG 
GTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTAC^CCAT^ 

CTGCTGGAGACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCC 

ATCTTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTG 

GTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTT 

CTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAG 

GGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGC 

CTGGTGTTTCTGTTCAAC^TGGCCATGCTA(3CCACCATGGTGGTGCaGATCCTGCGGCTGCGCCCCCACACCC^ 

AAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCC 

TTTGCTTCTGGCACCTTCCAGCTTGTCGTCCT^^ 

TTCATCTGGTACTGGTCC^TGCGGCTGCAGGC^^ 

AGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCAT CTAGG CCTCCAGCCCACCTGCCCAT 

CAGAGATGCGGCCTCGTCGCACACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACT 

TTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTG 

GCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTG 

TCCCCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCT^ 

GGCCAGGCCTTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCA 
GTTGCTCTGTCTCTCGTGGTCACCCTGAGG(jCACTCTGCATCCTCTGTCATTTTAACCTGAGGTGGCACCCAGGG 

cgaatggggcccagggcagaccttcagggcfzagagccctggcggaggagaggccctttgccaggagcacagcagc 
agctcgcctacctctgagcccaggccccctccctccctcagccccccagtcctccctccatcttccctggggttc 

gtggtttccaggagctgcctggtgtctgctgtaaatgtttgtctactgcacaagcctcggcctgcccctgagcca 
ggctcggtaccgatgcgtgggctgggctagc3tccctctgtccatctgggcctttgtatgagctgcattgcccttg 
ctcaccctgaccaagcacacgcctcagagg<3gccctcagcctctcctgaagccctcttgtggcaagaactgtgga 
ccatgccagtcccgtctggtttccatcccac^ 

gagcctgacactctcctaagaggttctctccaagcccccaaatagctccaggcgccctcggccgcccatcatggt 
taattctgtccaacaaacacacacgggtagattgctggcctgttgtaggtggtagggacacagatgaccgacctg 
gtcactcctcctgccaacattcagtctggtatgtgaggcgtgcgtgaagcaagaactcctggagctacagggac^ 
gggagcgktcattcctgcctgggaatcctgga^ 

GGGAAGGATGTTCTTTTTACGTACC^TTC^t'TTTGTCTTTTGATATTAAAAAGAAGTAC^TGTTCATTGTAGAGA 
ATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATAAAAATCAGCTGTTGTAATCGCCTAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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< /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ ss . DNA5 0921 
<subunit 1 of 1, 693 aa, 1 stop 
<MW: 77738, pi: 8.87, NX(S/T): 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
KVLGIWQNTKVANLTEPVVLTFQHQLQPKNVTLQCVFWVEDPTLSSPGHWSSAGCETVRRE 
TQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLVTIAAyLCSRVPLPC 
RRKPRDYTIKVHMNLLLAVFLLDTS FLLS E PVALTGSEAGCRASAI FLHFSLLTCLSWMGLE 
GYNLYRLWEVFGTYVPGYLLKLSAMGWGFP I FLVTLVALVDVDNYGP 1 1 LAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLF1MAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLG 
LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 

Important features: 
Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C-terminal targeting signal. 

amino acids 691-693 

cAMP- and cGMP- dependent protein kinase phosphorylation sites, 
amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 
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TGCCTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 
GGAAANCNTCGGGACTACACCNTCA!\GTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTG 
GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 
TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 
GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 
TGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 
TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 
GGTGTTTCTGTTCAACATGG 
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CGGACGCGTG<^CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTT(^GGTCCAGGTTTTGCTTTGA 
TCCTTTTCAAJU^CTGGAGAC^(^GAAGAG(3GCTCTAGGAAAAAGTTTTGGATGGGATTATGTGGAAACTACCCT 
GCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCCACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGAC 
TCGGGAGTCGCTGCTTCCaAAGTGCCCGCCGTGAGTGAGCTCT^ 

TTCTCCTGCTGACATCTGCCCTGGCCGGCCi\GAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAATTCC 
AGTTTTCCAGCAACAAGGAACAGAACGGAGTACAAGATCCTC 

GAAGTATTCAC^GCCCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTAG 
AGGAAAATGTATGGATACAACTTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGT 
ATGATTTTGTAGAAGTTGAGGAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAG 
GAAAACAGATTTCTAAAGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGT 
TCTGCATCC^CTACAACATTGTCATGCCaCAATTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTCAGCT^ 
TGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTACOTTGGAAGACCTTATTCGATATCTTGAACCAG 
AGAGATGGCAGTTGGACTTAGAAGATCTATATAGGCCAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAA 
GAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCAC^CCTCGTAACTTCT 
C^GTGTCCATAAGGGAAGAACTAAAGAGAACCGATAC<mTTTTCTGGCGaGGTTGTCTCCTGGTTAAACGCTGTG 
GTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACC 
ACGAGGTCCTTCAGTTGAGACCAAAGACCGCJTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGC 
ACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGG 

GAGCTGTGC^GTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTC<^TCCTTAATCTCaGTTGTTTGCr 
TCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAA^ 

ACAGCTCTTTTGAGAGGAGGCCTAAAGGAaiGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTAT 
TAAATAGATCACCAGCTAGTTTCAGAGTTACCaTGTACGTATTCCACTAGCTGGGTTCTGTATTTCaGTTCTTTC 
GATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 
TCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCACAT 
ATGTAAACCAGAACATTCTATGTACTAGAAACCTGGTTTTTAA 

GTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCG&TTTAGAAGAAGAG 
TTCATGGTTTGGAAGAGATAAACCTGAAAAC4AAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTG 
TTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCT 
ATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAACT 

AAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTT 

TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAA 

GACTTTTTGAAAATAATTAAATTATC^TATCTTCC^TTCCrGTTATTGGAGATGAAAATAAAAAGC^CTTATGA 

AAGTAGACATTCAGATCCAGCCATTACTAACCTATTC 

AAAGC^CCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCT 

TTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCC^TAC^CTTGTATAAATAC^TGGATATTTTTATGTAC^ 
GAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGC^^TTTAAAAGAAAATCAGTAAAATATTTTGCTTGT 
AAAATGCTTAATAITCGTGCCTAGGTTATGTG^ 

ATGTGGCTATTTTGGGGAGAAAATTAAAAAAAAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 



FIGURE 207 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPR 
FPHTYPRNTVLVWRLVAVEENVW I QLTFDERFGLEDPEDD I CKYDFVEVEEP SDGT I LGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKBRWDLNLLTEEVRLY 
SCTPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCV'PSKVTKKYHEVLQ 
LRPKTGVRGLHKSLTDVALEHHEE CD CV CRGSTGG 



Signal sequence: 

amino acids 1-14 



CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTCT 

AGACTAAA AATGG TGTTTCCAATGTGGACaCTGAAGAOACAAATTnTTAT^^TTTTTAAr^TaaTr'r'TaaTTTr'r^ 

aaactccttggggctagatggtttcctaaaivctctgccctgtgatgtc^ctctggatgttccaaagaaccatgtg 
atcgtggactgcacagacaagcatttgacagaaattcctggaggtattcccaggaacaccacgaacctcaccctc 
accattaaccacataccagacatctccc<^<3cgtcctttca(^gactggaccatctggtagagatcgatttcaga 
tgcaactgtgtacctattccactggggtcaaaa 

tttagtggactcacttatttaaaatccctttacctggatggaaaccagctactagagataccgcagggcctcccg 
cctagcttacagcttctcagccttgaggcg^caacatcttttccatcagaaaagagaatctaacagaactggcc 
aacatagaaatactctacctgggccaaaactgttattat^ 

gatgccttcctaaacttgacaaagttaaaagtgctctccctgaaagataacaatgtcacagccgtccctactgtt 
ttgcc^tctactttaacagaactatatctctac^ 

ctc^ccaattacaaattcttgacctaagtggaaattgccotcgttgttataatgccccatttccttgtgcgccg 
tgtaaaaataattctcccctacagatccctgtaaatgcttttgatgcgctgacagaattaaaagttttacgtcta 
cacagtaactctcttcagcatgtgccccca\gatggtttaagaacatcaacaaactccaggaactggatctgtcc 

tctttcaattttgaacttc^ggtctatcgtgcatctatgaatctatcacaagcattttcttcactgaaaagcctg 
aaaattctgcggatcagaggatatgtctttaaagagttgaaa 

aatcttgaagttcttgatcttggcactaactttataaaaattgctaacctcagcatgtttaaacaatttaaaaga 
ctgaaagtcatagatctttcagtgaataaaatatcaccttcaggagattcaagtgaagttggcttctgctcaaat 
gccagaacttctgtagaaagttatgaacccga£x3^ 
aggagttgcagattcaaaaacaaagaggcttctttcatgtctgttaa^ 

ttggatctaagtaaaaatagtatattttttgtcaagtcctctgattttcagcatctttctttcctcaaatgcctg 
aatctgtcaggaaatctcattagccaaactcttaa^ 
gacttctccaacaaccggcttgatttactccattcaac^ 
ataagcagtaatagcc^ttattttcaatcagaaggaattactc^ 

ctgcagaaactgatgatgaacgacaatgaai.tctcttcctccaccagcaggaccatggagagtgagtctcttaga 
actctggaattcagaggaaatcacttagatc^ 

aatctgctaarattagaggaattagacatctctaaaaattccctaagtttcttgccttctggagtttttgatggt 

atgcctccaaatctaaagaatctctctttggccaaaaatg 

ctaaagaacctggaaactttggacctcagcc^ca^ 

agaagcctc^gaatctgattcttaagaataatcaaatcaggagtctgacgaagtattttctacaagatgccttc 
cagttgcgatatctggatctcagctcaaataaaatccagatgatccaaaagaccagcttccgagaaaatgtcctc 
aacaatctgaagatgttgcttttgcatcataatcggtttctgtgcacctgtgatgctgtgtggtttgtctggtgg 
gttaaccatacggaggtgactattccttacc:tggccacagatgtgacttgtgtggggccaggagcacacaagggc 
caaagtgtgatctccctggatctgtacacctgtgagttagatctgactaacctgattctgttctcactttccata 
tctgtatctctctttctcatggtgatgatgacagcaagtcacctctatttctgggatgtgtggtatatttaccat 
ttctgtaaggccaagataaaggggtatcagcgtctaatatcacgagactgttgctatgatgcttttattgtgta^ 
gacactaaagacccagctgtgaccgagtgggjttttggctgagctggtggccaaactggaagacccaagagagaaa 
cattttaatttatgtctcgaggaaagggactg^ 

cagcttagcaaaaagacagtgtttgtgatgacagacaagtatgcaaagactgaaaattttaagatagc^ 

ttgtcccatcagaggctcatggatgaaaaagttgatgtgatoatcttgatatttcttgagaagccctttc^gaag 

tccaagttcctccagctccggaaaaggctcl'gtgggagttctgtccttgagtggccaacaaacccgcaagctcac 

ccatacttctggcagtgtctaaagaa.cgccc:tggccag^gacaatcatgtggcctatagt(^ggtgttcaagga^ 

acggtctagcccttctttgcaaaacacaac'i'gcctagtttaccaaggagaggcctggc 



FIGURE 209 



MVFPMWTLKRQILILFNI ILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEI PGG 
IPTNTTNLTLTINHIPDISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRIjQIKPRSFS 
GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYR 
NPCYVSYSIEKDAFLNLTKLKVLSLKDmm?AVPTVL^ 

NQLQ I LDLSGNCPRCYNAPFPCAPCiKNNSPLQI PVNAFDALTELKVLRLHSNSLQHVPPRWF 
KNINKLQELDLSQNFIAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 
KI LRI RGYVFKELKS FNLS PLHNLQNLEVLDLGTNFIKI ANLSMFKQFKRIiKVT DLSYNKI S 
PSGDSSEVGFCSNARTSVESYEPQVIjEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQ 
TLDLSKNSIFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLH 
STAFEELHKLEVLDISSNSHYFQSEGITHMLNFTKNLKVLQKLMMNDNDISSSTSRTMESES 
LRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNLSL 
AKNGLKSFSWKKXQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 
DAFQLRYLDLSSNKIQMIQKTSFPENVLI^LKM^ 

YLATDVTCVGPGAHKGQSVI SLDLYTCELDLTNLI LFSLS ISVSLFLMVMMTASHLYFWDVW 
YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEE 
RDWLPGQPVLENLSQSIQLSKICrVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 
KPFQKSKFLQLRKRLCGSSVLEWPTNPQAHPYFWQCLKNALATDNHVAYSQVFKETV 

Signal sequence: 

amino acids 1-26 

Transmembrane domain: 
amino acids 840-860 



FIGURE 210 



GGGTACC^TTCTGCGCTGCTGC^GTTACGG^TGAAAARTTAGAAC^C^GAAA CATGG AftAACaTGTTCCTTC 

AGTCGTCAATGCTGACCTGCATTTTCCTGCTAATATCTGGTTCCTGTGAGTTATGCGCCGAAGAAAATTTTTCTA 

GAAGCTATCCTTGTGATGAGAAAAAGCAAAATGACTCAGTTATT^ 

TTCCCCAAACGGTGGGCAAATATGTGACAGAACTAGACCTGTC^ 

CATTTCAAGGGCTGCAAAATCTCACTAAAATAAATCTAAACCACAACC 

CCGGTATACAATCAAATGGCTTGAATATCACAGACGGGGCATTCCTCAACCTAAAAAACCTAAGGGAGTTACTGC 
TTGAAGACAACCAGTTACCCCAAATACCCTCTGGTTTGCCAGAGTCTTTGAC^ 

ATATATACSkACATAACTAAAGAGGGCATTTCAAGACTTATAAACTTGAAAAATCTCTATTTGGCCTGGAACTGCT 

ATTTTAACAAAGTTTGCGAGAAAACTAACATAGAAGATGGAGTATTTGAAACGCTGACAAATTTGGAGTTGCTAT 

CACTATCTTTCAATTCTCTTTCACACGTGCCACCCAAACTGCCAAGCTCCCTACGCAAACTTTTTCTGAGCAACA 

CCCAGATCAAATACATTAGTGAAGAAGATTTCAAGGGATTGATAAATTTAACATTACTAGATTTAAGCGGGAACT 

GTCCGAGGTGCTTCAATGCCCCATTTCCATGCGTGCCTTGTGATGGTGGTGCTTCAATTAATATAGATCGTTTTG 

CTTTTCAAAACTTGACCCAACTTCGATACCTAAACCTCTCTAGCACTTCCCTCAGGAAGATTAATGCTGCCTGGT 

TTAAAAATATGCCTCATCTGAAGGTGCTGGATCTTGAATTCAACTATTTAGTGGGAGAAATAGTCTCTGGGGCAT 

TTTTAACGATGCTGCCCCGCTTAGAAATACTTGACTTGTCTTTTAACTATATAAAGGGGAGTTATCCACAGCATA 

TTAATATTTCCAGAAACTTCTCTAAACTTTTGTCTCTACGGGCATTGCATTTAAGAGGTTATGTGTTCCAGGAAC 

TCAGAGAAGATGATTTCCAGCCCCTGATGCAGCTTCCAAACTTATC 

AGCAAATCGATTTCAAACTTTTCCAAAATTTCTCCAATCTGG^ 

CGTTGGTAAAAGATACCCGGC^GAGTTATGCAAATAGTTCCTCTTTTCAACGTCATATCCGGAAACGACGCTCAA 
CAGATTTTGAGTTTGACCCACATTCGAACTTTTATCATTTCACCCGTCCTTTAATAAAGCCACAATGTGCTGCTT 
ATGGAAAAGCCTTAGATTTAAGCCTCAACAGTATTTTCTTCATTGGGC 

TTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCAAGTGTTAAGTGGAACTGAATTTTCAGCCATTCCTCATG 

TCAAATATTTGGATTTGACAAACAATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGAATTGTCCGACTTGG 

AAGTTCTAGATCTCAGCTATAATTCACACTATTTCAGAATAGCIAGGCGTAACACATCATCTAGAATTTATTCAAA 

ATTTC^CAAATCTAAAAGTTTTAAACTTGAGCCACAACAACATTTATACTTTAACAGATAAGTATAACCTGGAAA 

GCAAGTCCCTGGTAGAATTAGTTTTC^GTGGCAATCGCCTTGACATTTTGTGGAATGATGATGACAACAGGTATA 

TCTCCATTTTCAAAGGTCTCAAGAATCTGACACGTCTGGATTTATCCCTTAATAGGCTGAAGCACATCCCAAATG 

AAGCATTCCTTAATTTGCCAGCGAGTCTCACTGAACTAC^TATAAATGATAATATGTTAAAGTTTTTTAACTGGA 

CATTACTCCAGC^GTTTCCTCGTCTCGAGTTGCTTGACTTACGTGGAAACAAACTACTCTTTTTAACTGATAGCC 

TATCTGACTTTACATCTTCCCTTCGGACACTGCTGCTGAGTC^TAACAGGATTTCCCACCTACCCTCTGGCTTTC 

TTTCTGAAGTCAGTAGTCTGAAGCACCTCGATTTAAGTTCCAATCTGCTAAAAACAATCAACAAATCCGCACTTG 

AAACTAAGACCACCACCAAATTATCTATGTTGGAACTACa^ 

ATTTCCGAAGATGGATGGATGAACATCTGAATGTCAAAATTCCC^^ 

GGGATCAAAGAGGGAAGAGTATTGTGAGTCTGGAGCTAACAACTT 

TTTTCTTCACGTTCTTTATCACCACCATGGTTATGTTGGCTGCCCTGGCTCACCATTTGTTTTACTGGGATGTTT 
GGTTTATATATAATGTGTGTTTAGCTAAGGTAAAAGGCTACAGGTCTCTTTCCACATCCCAAACTTTCTATGATG 
CTTACATTTCTTATGACACCAAAGATGCCTCTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAG 

TCATGCAGAGCATCAACCAAAGCAAGAAAACAGTATTTGTTTTAACCAAAAAATATGCAA 

AAAC^GCTTTTTACTTGGCTTTGCAGAGGCTAATGGATGAGAACATGGATGTGATTATATTTATCCTGCTGGAGC 

CAGTGTTACAGCATTCTCAGTATTTGAGGCTACGGCAGCGGATCTGTAAGAGCTCCATCCTCCAGTGGCCTGACA 

ACCCGAAGGCAGAAGGCTTGTTTTGGCAAACTCTGAGAAATGTGGTCTTGACTGAAAATGATTCACGGTATAACA 

ATATGTATGTCGATTCCATTAAGCAATACI^CTGACG 

GAATGACATTTCTGTATTAGTTATCTATTGCTATO^ 

TTTGCTGGCCCACAGTTTTTGAGGGTCAGGSiGTCCAGGCCCAGCATAACTGGGTCCTCTGCTCAGGGTGTCTCAG 
AGGCTGCAATGTAGGTGTTCACC1AGAGACATAGGCATCACTGGGGTCACACTCATC 

ATTCCTCCTGGGCTATTGGCGAAAGGCTATACTCATGTAAGCCATGCGAGCCTCTCCCACAAGGCAGCTTGCTTC 
ATCAGAGCTAGCAAAAAAGAGAGGTTGCTAGCAAGATGAAGTCACAATCTTTTGTAATCGAATCAAAAAAGTGAT 
lATCTCATCACTTTGGCCATATTCTATTTGTTAGAAGTAAACC^CAGGTCCCACCAGCTCCATGGGAGTGACCACC 
TCAGTCCAGGGAAAACAGCTGAAGACCAAGAJGGTGAGCTCTGATTGCTTCAGTTGGTCATCAACTATTTTCCCT 
TGACTGCTGTCCTGGGATGGCCTGCTATCTTGATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACC 
ATCTTAG<^GTTGACCTAACACATCTTCTTTTCAATATCTAAGAACTTTTGCCACTGTGACTAATGGTCCTAATA 
TTAAGCTGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCGGTTTTAT 
TTACAGTTGCTTTTACAAATATTTGCTGTAACATTTGACTTCTAAGGTTTAGATGCCATTTAAGAACTGAGATGG 
ATAGCTTTTAAAGCATCTTTTACTTCTTACCATTTTTTAAAAGTATGCAGCTAAATTCGAAGCTTTTGGTCTATA 
TTGTTAATTGCCATTGCTGTAAATCTTAAAATGAATGAATAAAAATGTTTCATTTTAC^AAAAAAAAAAAAAAA 



FIGURE 211 

MENMPLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 
KYVTELDLSDNFITHITNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNL 
KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKV 
CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 
TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 
PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 
HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 
KDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFI 
GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 
LDLSYNSHYFRIAGVTHHLEFIQNF'rNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 
DILWlTODDNRYISIFKGLKNLTRLDLSLNRLKHIPNEAFLNLPASLTELHINDNMLKFFSnffT 
LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 
SNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASP 
GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMLAAI^AHHLFYWDWFIYIWCLAKVK 
GYRSLSTSQTFYDAYISYDTKDASWDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 
NLMQS INQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVI I FI LLEPVLQHSQYLRL 
RQRICKS S I LQWPDNPKAEGLFWQTIjRNVVLTFJSTDSRYNNMYVDS I KQY 

Signal sequence: 

amino acids 1-26 



Transmembrane domain: 

amino acids 826-848 



FIGURE 212 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCA/iGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 
GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 
GGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 
CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 
CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 
CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 
CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 
CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 
GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 
GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 
CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 
CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 
GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 
CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 
TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 
GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGT6ACTGCCCAGCGCCCCAGGCTG 
GACTGAGCCCCTCACGCCGCCCTGG^GCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 
AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 
TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 
CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 
AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 
GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 
AAA^AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

CGACCTGCAGAAGCTTGGCCGC(^TGGCCCAACTTGTTTATTG(^GCTTATAATGGTTAC^AT 



FIGURE 213 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCKNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 

1-19 



FIGURE 214 



GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAG 
GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AGCAGCATCAGAGCAGCCCCTGTGGTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGA 
GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGC 
AGTGGATGAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 
AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCCCCCAGGACCGGGGAGG 
CACAGGTGGCCCCCACCACCCGGAGG'rAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTC 
TCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGC 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 
CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCG 
TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 
CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 
CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 
GGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTG 
TCCCCAGCGCTGGA.TCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 
TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCG 
ACAGGAGTGGACAGTGCAATGAAGGASlGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCT 
GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 
ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 
TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 
CTCGTGACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
CCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 
AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 
GGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 
TCCCAAGGCCAGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGAC 
CCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 
CCCCAGCACAATAAAAATGAAACGTG 



FIGURE 215 



MRGSQEVLI^WLLVLAVGGTEHAyRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 
1-19 



FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGACAGGCCAGGCA 
GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 
TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 
CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 
CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 
CTGGAGGCACAGGCC ATGA GGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 
AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 
GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 
GGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 
GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 
CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAG 
CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGA 
ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 
GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 
CCCCCCAGGGTGGCCCCCAACCCGACA(3GAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 
GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 
GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 
TTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCT 
GGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 
TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCG 
GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 
GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 
CCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGGTAC 
GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGG 
CCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTG 



FIGURE 217 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGIiPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCVNTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEliCLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 

1-19 



FI GURE 218 



GGTTGCCACAGCTGGTTTAGGGCCCCGACCACTGGGGCCCCTTGTCAGGAC3GAGACAGCCTCCCGGCCCGGGGA.G 

GAC^^GTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTCCGTTTCCTGCCGTCAGCTGCCGGCCG 

AGTTGGGTCTCCGTGTTTCAGGCCGGCTCCCCCTTCCTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGG 

AGATTGTCTTCCAGGGCTAGC^TTGGACTTTTGATGATGOT^ 

TTCAAAGCTGGGCTCAGCCTCTGTTTCTTCTCTCGTC 

TCTCTGTGATGGTGGTGAGAAAGAAGGTGACJlCGGAAATGGGAGAAACTCCCAGGCAGGAACACCTTTTGCTGTG 

ATGGCCGCGTCATGATGGCCCGGCSiAAAGGGC^TTTTCTACCTGACCCTTTTCCTCATCCTGGGGACATGTA 

TCTTCTTCGCCTTTGAGTGCCGCTACCTGGCTCTTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCT 

TCCTTTTCTCCATGGCTACACTGTTGAGGACC1AGCTTGAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATG 

AAGC&GCTTTCATAGAAATGGAGATAGAAGCTACCAATGGTC^ 

TCAAGAATTTCCAGATAAACAACCAGATTGTGAAACT 

GGGCCTCCCATTGCAGCATCTGTGACAACTGTGTGGAGCGCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTG 
TTGGAAAGAGGAACTACCGCTACTTCTACCTCTTCATCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCT 
TCAACATCGTCTATGTGGCCCTCAAATCTTTGAAAATTGGCTTCTTGGAGAC^TTGAAAGAAACTCCTGGAACTG 
TTCTAGAAGTCCTCATTTGCTTCTTTACaCTCTGGTCCGTCGTGGGACTGACTGGATTTCATACTTTCCTCGTGG 
CTCTCAACCAGACAACCAATGAAGACATCAAAGGATC^ 

ATGGCAATATTGTGAAGAACTGCTGTGAAGTGCTGTGTGGCCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTA 
TTTTGCGACTGGAGGAAAGTGGAAGTCGACCTCCCAGTACTCAAGAGACCAGTAGCAGCCTCTTGCCACAGAGCC 
CAGCCCCCACAGAACACCTGAACTCAAATGAGIATC 

AGCCCCCAGAGCCACC^CAGGAGGCAGCTGAAGCTGAGAA GTAG CCTATCTATGGAAGAGACTTTTGTTTGTGTT 
TAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAGAGCAAGTAAGCTGTCCCTTTTAACT 
GTTTTTCTTTGGTCTTTAGTCACCCAGTTGCACACTGGCATTTTCTTGCTGCAAGCTTTTTTAAATTTCTGAACT 
CAAGGCAGTGGCAGAAGATGTC^GTCACCTCTGATAACTGGAAAAATGGGTCTCTTGGGCCCTGGCACTGGTTCT 
CCATGGCCTCAGCCAC^GGGTCCCCTTGGACCCCCTCTC^^ 

TGGTCTCATTCTGGGGCTAAAAGTTTTTGAGACTGGCTCAAATCCTCCC^GCTGCTGCACGTGCTGAGTCCAGA 

GGCAGTCACAGAGACCTCTGGCCAGGGGATCCXAACTGGGTTCTTGGGGTCTTCAGGACTGAAGAGGAGGGAGAG 

TGGGGTCAGAAGATTCTCCTGGCCACCAAGTGCCAGCA^ 

TCCACTTGTTGTANiraNNNlSnStt^^ 

CAGGAATGGCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACTTAAAC 

GCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCaAAAAAAAAAAAATGCTAACCAGTTCTTCCATTAAGCCT 
CGGCTGAGTGAGGGAAAGCCCAGCACTGCTGCCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCT 
ATGGTAACCACACTGGGGGCTTCCTCCAAGCCCCGCTCTT^ 

CACCCTGGGGGTGGGCTGTGGCCCCCAGTC^GCTCTGCTC^GGACOTGCTCTATTTCAGGGAAGAAGATTTATGT 
ATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAGGGTCCTGTCTGGATGACTTAT 
GCGGTGGGGGAGTGTAAACCGGAACTTTTCATCTATTTGAAGGCGATTAAACTGTGTCTAATGCA 



FIGURE 219 



MSVMVWK2CVTRKWEKLPGRNTFCCDC3RVMMARQKGI FYLTLFLI LGTCTLFFAFECRYLAV 
QLSPAI PVFAAMLFLFSMATLLRTS FSDPGVI PRALPDEAAF I EME I EATNGAVPQGQRPPP 
RIKNFQINNQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWVGNCVGKRNYRYFYL 
FILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 
LVALNQTTNEDIKGSWTGKNRVQNPY£3HGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSR 
PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 



Putative transmembrane domains: 

amino acids 36-55 (type II TM) , 65-84, 188-208, 229-245 



FIGURE 220 



AAAACCCTGTATTTTTTACAATGCAAATAGACAATNANCCTGGAGGTCTTTGAATTAGGTAT 
TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 
CCCACAGAGCNCTTCGACCATCACTGGCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTA 
CCGCTANTTCTACCTCTTCATCCTTTOTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 
ACATCGT 



FIGURE 221 



GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 
TCTATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGA 
AAAAAAATCATCAAAACC^TCCAGCCJ^AAAATGC^aiATTCTATCTCTTGGGCAATCTTCAC 
GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 
CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 
GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 
CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 
AGATCCAGAACGTGGATGTGTATGACGfAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 
CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 
TTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 
CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 
GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 
CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 
TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 
TCAGCAGTCCCCTCAGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTGAAGGAAA 
GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 
AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 
ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 
CTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTT TGAT GTGAGTGCC 
ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 
CGACAGCAACCAATCAGATATATACAAATGAAATTAGAAGAAACACAGCCTCATGGGACAGA 
AATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAA 
AATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGC 
ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 
GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 
AATTCAATCAGTCCATAGAGACGAACA(?AATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 
GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 
AAAAA 



FIGURE 222 



MKT I QPKMHNS I SWAI FTGLAALCLFQGVPVRSGDATFPKAMDNVTVRQGE S ATLRCTIDNR 
VTRVAWLNRSTI LYAGNDKWCLDPRVVLLSNTQTQYS IE IQNVDVYDEGPYTCSVQTDNHPK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 
EIQGITREQSGDYECSASNDVAAPVV.RRVKVTYNYPPYISEAKGTGVPVGQKGTLQCEASAV 
PSAEFQWYKDDKRLIEGKKGVKVEJ^PFLSKLIFFNVSEHDYGNYTCVASNKLGHTNASIML 
FGPGAVSEVSNGTSRRAGCVWLLPLL\TjHLLLKF 



Signal peptide: 

amino acids 1-28 



FIGURE 223 



GAAAAAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTC 
ACGGGGCTGGCTGCTCTGTGTCTCTl'CCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 
CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 
TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT 
GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 
CGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 
ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 
TCTTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAG 
ACCAGAG 



FIGURE 224 



ATGGCTGGTGACGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGA 

ATCACCGCCTGGCCCGACTCC^CC&TGA^^ 

■^GGGGACAAGACAGCTGTTAGGCTC^CGCACGCAG 

GCACTGCTTCTGKMCTGCCTTGTGGCCCra^ 

GAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTTTAC 
CAGTTCTCCTGTGGGGGCTGGATTCGGAGGASCCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGC 
CTCTGGGACCAAAACCAGGCCATACTGAAGCACCTGCTTGAA 

CAGAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCCAGCCACTGAGA 
GACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTGAAG 
GCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCACCGTC 

AATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAG 
AAAGTGCTCACTGCCTATCTGGATTACATGGAGGAACTGGGGATGCTGCTGGGTGGGCGGCCCACCTCCACGAGG 
GAGCAGATGCAGCAGGTGCTGGAGTTGGAGATACAGCTGGCCAACATCACAGTGCCCCAGGACCAGCGGCGCGAC 
GAGGAGAAGATCTACCACAAGATGAGCATTTCGGAGCTGCAGGCTCTGGCGCCCTCCATGGACTGGCTTGAGTTC 
CTGTCTTTCTTGCTGTCACCATTGGAGTTGAGTGACTCTGAGCCTGTGGTGGTGTATGGGATGGATTATTTGCAG 
CAGGTGTCAGAGCTC^TO^CCGCACGGAACCAAGCATCCTGAACAATTACCTGATCTGGAACCTGGTGCAAAAG 
ACAACCTCAAGCCTGGACCGACGCTTTGAGTCTGCACAAGAGAAGCTGCTGGAGACCCTCTATGGCACTAAGAAG 
TCCTGTGTGCCGAGGTGGCAGACCTGCATCTCCAACACGGATGACGCCCTTGGCTTTGCTTTGGGGTCACTCTTC 
GTGAAGGCCACGTTTGACCGGCAAAGCAAAGAAATTGCAGAGGGGATGATCAGCGAAATCCGGACCGCATTTGAG 
GAGGCCCTGGGACAGCTGGTTTGGATGGATGAGAAGACCCGCCAGGCAGCCAAGGAGAAAGCAGATGCCATCTAT 
GATATGATTGGTTTCCCAGACTTTATCCTGGAGCCCAAAGAGCTGGATGATGTTTATGACGGGTACGAAATTTCT 
GAAGATTCTTTCTTCCAAAACATGTTGAATTTGTACAACTTCTCTGCCAAGGTTATGGCTGACCAGCTCCGCAAG 
CCTCCCAGCCGAGACCAGTGGAGCATGACCCCCCAGACAGTGAATGCCTACTACCTTCCAACTAAGAATGAGATC 
GTCTTCCCCGCTGGC^TCCTGCAGGCCCCCTTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATC 
GGTGTGGTCATGGGCCATGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTG 
CGGCCCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAGTACAATCAA 
TACCAGGTCAATGGGGAGAGGCTCT^CGGCCG^^CAGACGCTGGGGGAGAACATTACTGACAACGGGGGGCTGAAG 
GCTGCCTACAATGCTTACAAAGCATGGCTGAGJ^AAGCATGGGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACC 
AACCACCAGCTCTTCTTCGTGGGATTTGCCCA.&GTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGG 
CTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGG 
CACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCC^GGGCAGCTGTGTGAGGTGTGGTOGACCTGGATCAGGGGA 
GAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAA 
^TGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCGACAGGTGAf^ 

TGCCTCTGCTTTGGGGGTGCCCCTGCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCT 
GCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCC 



FIGURE 225 



MNVALQE LGAGSNVGFQKGTRQLLGSRTQLELVLAGAS LLLAALLLGCLVALGVQYHRDPSH 
STCLTEACIRVAGKILESLDRGVSPCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 
ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 
FMEVLKAVAGTYRATPFFTVYI SADSKS SNSNVI QVDQSGLFLPSRDYYLNRTANEKVLTAY 
LDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKIYHKMSISELQALAP 
SMDWLE FLS FLLS PLE L SDS E PVWYGMD YLQQVS EL INRTE PS I LNNYL IWNLVQKTTS S L 
DRRFESAQEKLLETLYGTKKSCVPRWQTCISNTDDALGFALGSLFVKATFDRQSKEIAEGMI 
S E I RTAFEE ALGQLVWMDEKTRQAAKEKADAI YDMI GFPDF I LE PKELDDVYDGYE I SEDS F 
FQMyELNLYNFSAKVMADQLRKPPSRDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNH 
PKALNFGGIGWMGHELTHAFDDQGREYDKEGNLRPWWQNESLAAFRNHTACMEEQYNQYQV 
NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCS 
VRTPESSHEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 



Type II Transmembrane domain: 

amino acids 32-57 



GCCCGGCCCTCCGCCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTCCCTCCCTCCTCCC 
CAGCTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCGCTGCTGCTCCTCGGGCTGCTGCTGCT 
CGGCTCCCGGCCGGCCCGCGGCGCCGGCCCAGtAGCCCCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCTGCC 
CGTTCGGGGAGCGGCAGGTAGGTGGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGC 
CAGCCCGGAGGGGGCGCGGGGCGCAGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAGGGC 
GCGGTGCCTGGGACCCGGGACCCGCGGGC^GCCCCCGGGGCGGC^CACGGCGCGAGCTGGGCAGCGGCCTCCAGC 
CAAGCCCGTCCCCGCAGGCTGCACCTTCGGCGGGAAGGTCTATGCCTTGGACGAGACGTGGCACCCGGACCTAGG 
GGAGCCATTCGGGGTGATGCGCTGCGTGCTGTGCGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGG 
CAGGGTCAGCTGCAAGAACATCAAACCAGAGT'GCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCTGCCGGGACA 
CTGCTGCCAGACCTGCCCCCAGGACTTCGTGGCGCTGCTGACAGGGCCGAGGTCGCAGGCGGTGGCACGAGCCCG 
AGTCTCGCTGCTGCGCTCTAGCCTCCGCTTCTCTATCTCCTACAGGCGGCTGGACCGCCCTACCAGGATCCGCTT 
CTCAGACTCCAATGGCAGTGTCCTGTTTGAGCACCCTGCAGCCCCCACCCAAGATGGCCTGGTCTGTGGGGTGTG 
GCGGGCAGTGCCTCGGTTGTCTCTGCGGCTCCTTAGGGCAGAACAGCTGCATGTGGCACTTGTGACACTCACTCA 
CCCTTCAGGGGAGGTCTGGGGGCCTCTCATCCGGCACCGGGCCCTGTCCCCAGAGACCTTCAGTGCCATCCTGAC 
TCTAGAAGGCCCCCACCAGCAGGGCGTAGGGGGCATCACCCTGCTCACTCTCAGTGACACAGAGGACTCCTTGCA 
TTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGTTCCCTTGAGGCTCCAGATTCTACACCAGGGGCA 
GCTACTGCGAGAACTTCAGGCCAATGTCTCAGCCCAGGAACCAGGCTTTGCTGAGGTGCTGCCCAACCTGACAGT 
CCAGGAGATGGACTGGCTGGTGCTGGGGGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCAGGGCTGCGCAT 
CAGTGGAGACATTGCTGCCAGKSAAGAGOTGCGACGTCCTGC?^AAGTGTCCTTTGTGGGGCTAATGCCCTGATCCC 
AGTCCAAACGGGTGCTGCCGGCTCAGCCAGCCTCACT 

GGTAGGGACAACCAGTGAGGTGGTGGCCATGACACTGGAAACCAAGCCTCAGCGGAGGGATCAGCCCACTGTCCT 

GTGCCACATGGCTGGCCTATCCTCCCCTGCCCCCAGGCCGTGGGTATCTGCCCTGGGCTGGGGTGCCCGAGGGGC 

TCATATGCTGCTGCAGAATGAGCTCTTCCTGAACGTGGGCACCAAGGACTTCCCAGACGGAGAGCTTCGGGGGCA 

ACGTGGCTGCCCTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGTGCTACC 

CCCTGTGAAGAGCC^GC^GCAGGGCACGCCTGGCTTTCCTTGGATACCCACTGTCACCTGCACTATGAAGTGCT 

GCTGGCTGGGCTTGGTGGCTCAGAACAAGGCACTGTCACTGCCCACCTCCTTGGGCCTCCTGGAACGCCAGGGCC 

TCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAGGCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCG 

GCACCTGGCAAAAGGCATGGCTTCCCTGATGATCACCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGCTVGCC^ 

CTCCTCCCAGGTGCACATAGCCAACCAATGTGAGGTTGGCGGACTGCGCCTGGAGGCGGCCGGGGCCGAGGGGGT 

GCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCTCCCGGCCCTAGCGCCCGC 

CAAACCTGGTGGTCCTGGGCGGCCCCGAGACCCCAACACATGCTTCTTCGAGGGGCAGCAGCGCCCCCACGGGGC 

TCGCTGGGCGCCC^^CTACGACCCGCTCTGCTtZACTCTGCACCTGCCAGAGACGAACGGTGATCTGTGACCCGGT 

GGTGTGCCC^CCGCCCAGCTGCCCACACCCGGTGCAGGCTCCCGACCAGTGCTGCCCTGTTTGCCCTGGCTGCTA 

TTTTGATGGTGACCGGAGCTGGCGGGCAGCGGGTACGCGGTGGCACCCCGTTGTGCCCCCCTTTGGCTTAATTAA 

GTGTGCTGTCTGCACCTGCAAGCAGGGGGGCACTGGAGAGGTGCACTGTGAGAAGGTGCAGTGTCCCCGGCTGGC 

CTGTGCCCAGCCTGTGCGTGTCAACCCCACCGACTGCTGCAAACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGG 

GGACCCCATGCAGGCTGATGGGCCCCGGGGCTGCCGTTTTGCTGGGCAGTGGTTCCCAGAGAGTCAGAGCTGGCA 

CCCCTCAGTGCCCCCGTTTGGAGAGATGAGCTGTATCACCTGCAGATGTGGGGTAAGTGGGGAGCAGAGGCTTGT 

GTGAGGTGGGTACTGGGAGCCTGGTCTGGAGmGGGAGACCTTCCCAGGGAGGTCCCTGAAGAAGCTGAAGGTCA 

CTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCTCTGTCTTGTACCAGGCAGGGGTGCCTCACTGTGAGC 

GGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGGAAGGAGAGTCGATGCTGTTCCCGCTGCACGGCCCACC 

GGCGGCGTAAGTGAGGGAGTCCAGGGTCAGCA(3CTGTGAGTGGAGGGCTCACCTGCCTGTGGGACTCCTGATCAG 

GGAAGGGAGCACTCA.CTGTGTGCAGGAACAGTGCAGCCTGCCTCACAAGTGCCATTCCAATCCACCCTCA 

ACCTGGTGGAATTGTTATTTATGACCTTTTCTTTAC^WVTGAGATTTCTGAAGCTCAGAGAAATTAAGCAACGAG 

ATGAAGGTCACCCAGCTGTGTGCACTGACCTGTTTAGAAAATACTGGCCTTTCTGGGACCAAGGCAGGGATGCTT 

TGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCACTCCCTCTCCCCTCCTCCAACATTCCCTCCCTTCTGTCTCC 

AGCAGCCCCAGAGACCAGAACTGATCCAGAGCTGGAGAAAGAAGCCGAAGGCTCTTAGGGAGCAGCCAGAGGGCC 

AAGTGACCAAGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGGGAAG 

CCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAACCACAGCTCCACAAGGGG 

GAGAGGCAGCTGGGCCAGACCGAGGTCACAGCCACTCCAAGTCCTGCCCTGCCACCCTCGGCCTCTGTCCTGGAA 

GCCCCACCCCTTTCTTCCTGTACATAATGTCACTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACCAAG 

GGCCCCGGACACTCCACTCCTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAAC 

ATTTCTTTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGGAGAAGGG 

GCNGAGAGTAGGAGGTGAGAGAGAGGAGCTCTGACACTTGGGGAGCTGAAAGAGACCTGGAGAGGCAGAGGATAG 

CGTGGCNNTTGGCTGGCATNCCTGGGTTCCGCTiGAGGGGCTGGGGATGGTTCTTGAGATGGTCTAGAGACTCAAG 

AATTTAGGGAAGTAGAAGC^GGATTTTGACTCAAGTTTAGTTTCCC^CATCGCTGGCCTGTTTGCTGACTTC^ 

TTTGAAGTTGCTCCAGAGAGAGAATCAAAGGTGTCACC^ 

TTCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCG 



FIGURE 227 

GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 
CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 
TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGG 
GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTAC 
TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTC C AGTAC CACAGAGACCCA 
TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 
GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 
GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 
CAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 
GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 
AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 
GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 
CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 
GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 
ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 
TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 
CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 
TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 
GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 
AGATAGGCTGATAACAAAGCTCACAAGrAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 
AGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 



FIGURE 228 



ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 
TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 
TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 
CCTGGTGGCAGAATGAGTCCCTGGCilGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 
TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 
TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 
GGGAGGAGCAGCAACTGCCAGCCGTC5GGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 
GCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 
CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 
ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 
TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 
TTGCTCTTGGGTTGGGAGGAAGCAAPiTGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 
GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 
GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 
GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 
CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 
TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 
TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 
CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 
TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 
GTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGAAAGCAGATA 
GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 
TGGC CCTTATAGGACC 



FIGURE 229 

CCCACGCGTCCGAGCCGCCCGAGAATTAGACACACTCCGGACGCGGCCAAAAGCAACCGAGA 
GGAGGGGAGGCAAAAACACCGAAAAi\CAAAAAGAGAGAAACAACACCCAACAACTGGGGTGG 
GGGGAAGAAAGAAAGAAAAGAAACCCACCCACCCACCAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAATCCTGTGGCGCGCCGCCTGGTTCCCGGGAAGACTCGCCAGCACCAGGGGG 
TGGGGGAGTGCGAGCTGAAAGCTGCTGGAGAGTGAGCAGCCCTAGCAGGGATGGACATGATG 
CTGTTGGTGCAGGGTGCTTGTTGCTCGAACCAGTGGCTGGCGGCGGTGCTCCTCAGCCTGTG 
CTGCCTGCTACCCTCCTGCCTCCCGGCTGGACAGAGTGTGGACTTCCCCTGGGCGGCCGTGG 
ACAACATGATGGTCAGAAAAGGGGACIACGGCGGTGCTTAGGTGTTATTTGGAAGATGGAGCT 
TCAAAGGGTGCCTGGCTGAACCGGTCAAGTATTATTTTTGCGGGAGGTGATAAGTGGTCAGT 
GGATCCTCGAGTTTCAATTTCAACATTGAATAAAAGGGACTACAGCCTCCAGATACAGAATG 
TAGATGTGACAGATGATGGCCCATACJACGTGTTCTGTTCAGACTCAACATACACCCAGAACA 
ATGCAGGTGCATCTAACTGTGCAAGTTCCTCCTAAGATATATGACATCTCAAATGATATGAC 
CGTCAATGAAGGAACCAACGTCACTC'TTACTTGTTTGGCCACTGGGAAACCAGAGCCTTCCA 
TTTCTTGGCGACACATCTCCCCATCAGCAAAACCATTTGAAAATGGACAATATTTGGACATT 
TATGGAATTACAAGGGACCAGGCTGGGGAATATGAATGCAGTGCGGAAAATGCTGTGTCATT 
CCCAGATGTGAGGAAAGTAAAAGTTGTTGTCAACTTTGCTCCTACTATTCAGGAAATTAAAT 
CTGGCACCGTGACCCCCGGACGCAGTGGCCTGATAAGATGTGAAGGTGCAGGTGTGCCGCCT 
CCAGCCTTTGAATGGTACAAAGGAGAGAAGAAGCTCTTCAATGGCCAACAAGGAATTATTAT 
TCAAAATTTTAGCACAAGATCCATTCTCACTGTTACCAACGTGACACAGGAGCACTTCGGCA 
ATTATACCTGTGTGGCTGCCAACAAGCTAGGCACAACCAATGCGAGCCTGCCTCTTAACCCT 
CCAAGTACAGCCCAGTATGGAATTAC^CGGGAGCGCTGATGTTCTTTTCTCCTGCTGGTACCT 
TGTGTTGACACTGTCCTCTTTCACCAGCATATTCTACCTGAAGAATGCCATTCTACA ATAAA 
TTCAAAGACCCATAAAAGGCTTTTAAGGATTCTCTGAAAGTGCTGATGGCTGGATCCAATCT 
GGTACAGTTTGTTAAAAGCAGCGTGGGATATAATCAGCAGTGCTTACATGGGGATGATCGCC 
TTCTGTAGAATTGCTCATTATGTAAATACTTTAATTCTACTCTTTTTTGATTAGCTACATTA 
CCTTGTGAAGCAGTACACATTGTCCITTTTTTAAGACGTGAAAGCTCTGAAATTACTTTTAG 

aggatattaattgtgatttcatgtttgtaatctacaacttttcaaaagcattcagtcatggt 
ctgctaggttgcaggctgtagtttacaaaaacgaatattgcagtgaatatgtgattctttaa 
ggctgcaatacaagcattcagttccc:tgtttcaataagagtcaatccacatttacaaagatg 

CATTTTTTTCTTTTTTGATAAAAAAGCAAATAATATTGCCTTCAGATTATTTCTTCAAAATA 
TAACACATATCTAGATTTTTCTGCTTGCATGATATTCAGGTTTCAGGAATGAGCCTTGTAAT 
ATAACTGGCTGTGCAGCTCTGCTTCI'CTTTCCTGTAAGTTCAGCATGGGTGTGCCTTCATAC 
AATAATATTTTTCTCTTTGTCTCCAA.CTAATATAAAATGTTTTGCTAAATCTTACAATTTGA 
AAGTAAAAATAAACCAGAGTGATCAAGTTAAACCATACACTATCTCTAAGTAACGAAGGAGC 
TATTGGACTGTAAAAATCTCTTCCTGCACTGACAATGGGGTTTGAGAATTTTGCCCCACACT 
AACTCAGTTCTTGTGATGAGAGACAa.TTTAATAACAGTATAGTAAATATACCATATGATTTC 
TTTAGTTGTAGCTAAATGTTAGATCCACCGTGGGAAATCATTCCCTTTAAAATGACAGCACA 
GTCCACTCAAAGGATTGCCTAGCAATACAGCATCTTTTCCTTTCACTAGTCCAAGCCAAAAA 
TTTTAAGATGATTTGTCAGAAAGGGCACAAAGTCCTATCACCTAATATTACAAGAGTTGGTA 
AGCGCTCATCATTAATTTTATTTTGTGGCAGGTATTATGACAGTCGACCTGGAGGGTATGGA 
TATGGATATGGACGTTCCAGAGACTATAATGGCAGAAACCAGGGTGGTTATGACCGCTACTC 
AGGAGGAAATTACAGAGACAATTATGACAACTGAAATGAGACATGCACATAATATAGATACA 
CAAGGAATAATTTCTGATCCAGGATCGTCCTTCCAAATGGCTGTATTTATAAAGGTTTTTGG 
AGCTGCACTGAAGCATCTTATTTTATAGTATATCAACCTTTTGTTTTTAAATTGACCTGCCA 
AGGTAGCTGAAGACCTTTTAGACAGTTCCATCTTTTTTTTTAAATTTTTTCTGCCTATTTAA 
AGACAAATTATGGGACGTTTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 230 



I^LLVQGACCSNQWLAAVLLSLCCLLPSCLPAGQSVDFPWAAVDNMMVRKGDTAVLRCYLED 
GASKGAWLNRSSIIFAGGDKWSVDPRVSISTLNKRDYSLQIQNVDVTDDGPYTCSVQTQHTP 
RTMQVHLTVQVPPKI YD I SNDMTVNEGTNVTLTCLATGKPEPS I S WRHI SPSAKPFENGQYL 
D I YGI TRDQAGE YECS AENAVS FPDVRKVKVWNFAPTI QEI KSGTVTPGRSGL IRCEGAGV 
PPPAFEWYKGEKKLFNGQQGIIIQNFSTRSILTVTNVTQEHFGJ^TCVAANKLGTTNASLPL 
NPPSTAQYGITGSADVLFSCWYLVLTLSSFTSIFYLKNAILQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 326-345 

N-glycosylation sites. 

amino acids 71-75, 153-157, 273-277, 284-288, 292-296, 305-309 

Casein kinase II phosphorylation site. 

amino acids 147-151, 208-212, 224-228 

Tyrosine kinase phosphorylation site. 

amino acids 178-186 

N-myristoylation sites. 

amino acids 7-13, 63-70, 67-73, 151-157, 239-245, 291-297, 
302-308, 319-325 



Myelin P0 protein: 

amino acids 92-121 
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AGTGGTTCGATGGGAAGGATCTTTCTCCAAGTGGTTCCTCTTGAGGGGAGCATTTCTGCTGG 
CTCCAGGACTTTGGCCATCTATAAAGCTTGGCAATGAGAAATAAGAAAATTCTCAAGGAGGA 
CGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAA 
TCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATC 
TACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGC 
GCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 
CCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTG 
CAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACTTGCTGCAGCGGGT 
AGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAACAAGGCGCCCCAGGTC 
TTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCCCGCCGGGACCACCTGCT 
GAGAAGGGAGCCAAGGGGGCTATGGGACGAGATGGAGCAACAGGCCCCTCGGGACCCCAAGG 
CCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAG 
GAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTCATT 
GGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGACCTGGGTCTCCCAGGAAGCAA 
AGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAGGGGAGTA 
AAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGAT 
CAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGC 
CAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGA 
GTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGACACAGGACTTCAAGGACAGCAAGGA 
AGAAAAGGAGAATCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGG 
GCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAG 
GATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCC 
GTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGG 
GACAATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTT 
ACTCGAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 
GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCA 
TGACTGCAGCCACGAGGAGGACGCAGGCGTGGAGTGCAGCGTCTGACCCGGAAACCCTTTCA 
CTTCTCTGCTCCCGAGGTGTCCTCGGGCTCATATGTGGGAAGGCAGAGGATCTCTGAGGAGT 
TCCCTGGGGACAACTGAGCAGCCTCTGGAGAGGGGCCATTAATAAAGCTCAACATCATTGA 



FIGURE 232 

></usr/seqdb2/sst/DNA/Dnaseqs.full/ss.DNA68886 
xsubunit 1 of 1, 520 aa, 1 stop 
><MW: 52658, pi: 9.16, NX(S/T): 3 

MRNKKILKEDELLSETQQAAFHQI^IEPFEINVPKPKRRNGVNFSLAVWIYLILLTAGAGL 
LWQVLNLQARLRVLEMYFLNDTLAJ\.EDSPSFSLLQSAHPGEHLAQGASRLQVLQAQLTWVR 
VSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRD 
GATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGE 
KGDLGLPGSKGDRGMKGDAGYMGPPCiAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPG 
PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPA 
GVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGR 
AEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLW 
S CTKNSWGHHDCSHEEDAGVECSV 

Transmembrane domain: 

amino acids 47-66 (type II) 

N-glycosylation sites. 

amino acids 43-47, 83-87, 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 432-440 

N-myristoylation sites. 

amino acids 41-47, 178-184, 253-259, 274-280, 340-346, 346-352, 
400-406, 441-447, 475-481, 490-496, 515-521 

Amidation site. 

amino acids 360-364 

Leucine zipper pattern. 

amino acids 56-78 

Speract receptor repeat 

amino acids 422-471, 488-519 

Clq domain proteins. 

amino acids 151-184, 301-334, 316-349 
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CCCACGCGTCCGAAGGCAGACAAAGGTTCATTTGTAAAGAAGCTCCTTCCAGCACCTCCTCT 
CTTCTCCTTTTGCCCAAACTCACCCAGTGAGTGTGAGCATTTAAGAAGCATCCTCTGCCAAG 
ACCAAAAGGAAAGAAGAAAAAGGGCCAAAAGCCAAAATGAAACTGATGGTACTTGTTTTCAC 
CATTGGGCTAACTTTGCTGCTAGGAGTTCAAGCCATGCCTGCAAATCGCCTCTCTTGCTACA 
GAAAGATACTAAAAGATCACAACTGTCACAACCTTCCGGAAGGAGTAGCTGACCTGACACAG 
ATTGATGTCAATGTCCAGGATCATTTCTGGGATGGGAAGGGATGTGAGATGATCTGTTACTG 
CAACTTCAGCGAATTGCTCTGCTGCGCAAAAGACGTTTTCTTTGGACCAAAGATCTCTTTCG 
TGATTCCTTGCAACAATCAATGAGAA.TCTTCATGTATTCTGGAGAACACCATTCCTGATTTC 
CCACAAACTGCACTACATCAGTATAACTGCATTTCTAGTTTCTATATAGTGCAATAGAGCAT 
AGATTCTATAAATTCTTACTTGTCTAAGACAAGTAAATCTGTGTTAAACAAGTAGTAATAAA 
AGTTAATTCAATCTAAAAAAAAAAAAA 
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< / us r / seqdb2 / s s t /DNA/Dna seqs . min/ s s . DNA5 2 7 5 8 
<subunit 1 of 1, 98 aa, 1 stop 
<MW: 11081, pi: 6.68, NX(S/T) : 1 

MKLMVLVPTIGLTLLLGVQAMPANRLS C YRKI LKDHNCHNLPEGVADLTQ IDVNVQDHFWDG 
KGCEMICYCNFSELLCCPKDVFFGPKISFVIPCNNQ 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-76 

Tyrosine kinase phosphoryleition site. 

amino acids 63-71 



FIGURE 235 

CCCACGCGTCCGCGGACGCGTGGGCTGGACCCCAGGTCTGGAGCGAATTCCAGCCTGCAGGG 
CTGATAAGCGAGGCATTAGTGAGATTGAGAGAGACTTTACCCCGCCGTGGTGGTTGGAGGGC 
GCGCAGTAGAGCAGCAGCACAGGCGCGGGTCCCGGGAGGCCGGCTCTGCTCGCGCCGAGATG 
TGGAATCTCCTTCACGAAACCGACTCGGCTGTGGCCACCGCGCGCCGCCCGCGCTGGCTGTG 
CGCTGGGGCGCTGGTGCTGGCGGGTGGCTTCTTTCTCCTCGGCTTCCTCTTCGGGTGGTTTA 
TAAAATCCTCCAATGAAGCTACTAACATTACTCCAAAGCATAATATGAAAGCATTTTTGGAT 
GAATTGAAAGCTGAGAACATCAAGAAGTTCTTACATAATTTTACACAGATACCACATTTAGC 
AGGAACAGAACAAAACTTTCAGCTTGC^AAAGCAAATTCAATCCCAGTGGAAAGAATTTGGCC 
TGGATTCTGTTGAGCTAGCTCATTATGATGTCCTGTTGTCCTACCCAAATAAGACTCATCCC 
AACTACATCTCAATAATTAATGAAGATGGAAATGAGATTTTCAACACATCATTATTTGAACC 
ACCTCCTCCAGGATATGAAAATGTTTCGGATATTGTACCACCTTTCAGTGCTTTCTCTCCTC 
AAGGAATGCCAGAGGGCGATCTAGTGTATGTTAACTATGCACGAACTGAAGACTTCTTTAAA 
TTGGAACGGGACATGAAAATCAATTGCTCTGGGAAAATTGTAATTGCCAGATATGGGAAAGT 
TTTCAGAGGAAATAAGGTTAAAAATGCCCAGCTGGCAGGGGCCAAAGGAGTCATTCTCTACT 
CCGACCCTGCTGACTACTTTGCTCCTGGGGTGAAGTCCTATCCAGACGGTTGGAATCTTCCT 
GGAGGTGGTGTCCAGCGTGGAAATATGCTAAATCTGAATGGTGCAGGAGACCCTCTCACACC 
AGGTTACCCAGCAAATGAATATGCTTATAGGCGTGGAATTGCAGAGGCTGTTGGTCTTCCAA 
GTATTCCTGTTCATCCAATTGGATACTATGATGCACAGAAGCTCCTAGAAAAAATGGGTGGC 
TCAGCACCACCAGATAGCAGCTGGAGAGGAAGTCTCAAAGTGCCCTACAATGTTGGACCTGG 
CTTTACTGGAAACTTTTCTACACAAAAAGTCAAGATGCACATCCACTCTACCAATGAAGTGA 
CGAGAATTTACAATGTGATAGGTACTCTCAGAGGAGCAGTGGAACCAGACAGATATGTCATT 
CTGGGAGGTCACCGGGACTCATGGGTGTTTGGTGGTATTGACCCTCAGAGTGGAGCAGCTGT 
TGTTCATGAAATTGTGAGGAGCTTTGGAACACTGAAAAAGGAAGGGTGGAGACCTAGAAGAA 
CAATTTTGTTTGCAAGCTGGGATGCAGAAGAATTTGGTCTTCTTGGTTCTACTGAGTGGGCA 
GAGGAGAATTCAAGACTCCTTCAAGAGCGTGGCGTGGCTTATATTAATGCTGACTCATCTAT 
AGAAGGAAACTACACTCTGAGAGTTGATTGTACACCGCTGATGTACAGCTTGGTACACAACC 
TAACAAAAGAGCTGAAAAGCCCTGATGAAGGCTTTGAAGGCAAATCTCTTTATGAAAGTTGG 

actaaaaaaagtccttccccagagttc;agtggcatgcccaggataagcaaattgggatctgg 

AAATGATTTTGAGGTGTTCTTCCAACGACTTGGAATTGCTTCAGGCAGAGCACGGTATACTA 

aaaattgggaaacaaacaaattcagcggctatccactgtatcacagtgtctatgaaacatat 
gagttggtggaaaagttttatgatccaatgtttaaatatcacctcactgtggcccaggttcg 

AGGAGGGATGGTGTTTGAGCTAGCCAATTCCATAGTGCTCCCTTTTGATTGTCGAGATTATG 
CTGTAGTTTTAAGAAAGTATGCTGACAAAATCTACAGTATTTCTATGAAACATCCACAGGAA 
ATGAAGACATACAGTGTATCATTTGATTCACTTTTTTCTGCAGTAAAGAATTTTACAGAAAT 
TGCTTCCAAGTTCAGTGAGAGACTCCAGGACTTTGACAAAAGCAACCCAATAGTATTAAGAA 
TGATGAATGATCAACTCATGTTTCTGGAAAGAGCATTTATTGATCCATTAGGGTTACCAGAC 
AGGCCTTTTTATAGGCATGTCATCTATGCTCCAAGCAGCCACAACAAGTATGCAGGGGAGTC 
ATTCCCAGGAATTTATGATGCTCTGTTTGATATTGAAAGCAAAGTGGACCCTTCCAAGGCCT 
GGGGAGAAGTGAAGAGACAGATTTATGJTTGCAGCCTTCACAGTGCAGGCAGCTGCAGAGACT 
TTGAGTGAAGTAGCCTAAGAGGATTT1TTAGAGAATCCGTATTGAATTTGTGTGGTATGTCA 
CTCAGAAAGAATCGTAATGGGTATATTGATAAATTTTAAAATTGGTATATTTGAAATAAAGT 
TGAATATTATATATAA 



FIGURE 236 

></usr/seqdb2/sst/DNA/Dnaseqs . full/ss .DNA52756 
xsubunit 1 of 1, 750 aa, 1 stop 
><MW: 84305, pi: 6.93, NX(S/T): 10 

MWNLLHETDSAVATARRPRWLCAGAL\TjAGGFFLLGFLFGWFIKSSNEATNITPKHNMKAFL 
DELKAENIKKFLHNFTQIPHLAGTEQNFQLAKQIQSQWKEFGLDSVELAHYDVLLSYPNKTH 
PNY I S I INEDGNE I FNTSLFEPPPPGYENVSD I VPPFSAFS PQGMPEGDLVYVNYARTEDFF 
KLERDMKINCSGKIVIARYGKVFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNL 
PGGGVQRGNI LNLNGAGDPLTPGYPANE YAYRRG I AEAVGLPS I PVHP I GYYD AQKLLE KMG 
GSAPPDSSWRGSLKVPYNVGPGFTGNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYV 
ILGGHRDSWFGGIDPQSGAAVVHEIWSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEW 
AEENSRLLQERGVAYINADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYES 
WTKKSPSPEFSGMPRISKLGSGNDFEWFQRLGIASGRARYTKNWETNKFSGYPLYHSVYET 
YELVEKF YDPMFKYHLTVAQVRGGMVFELANS I VLPFDCRDYAWLRKYADKI YS I SMKHPQ 
EMKTYSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP 
DRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESKVDPSKAWGEVKRQIYVAAFTVQAAAE 
TLSEVA 

Signal sequence: 

amino acids 1-40 

N-glycosylation sites. 

amino acids 76-80, 121-125, 140-144, 153-157, 195-199, 336-340, 
459-463, 476-480, 638-642 

Tyrosine kinase phosphorylation sites. 

amino acids 363-372, 605-613, 606-613, 617-626 

N-myristoylation sites. 

amino acids 85-91, 168-174, 252-258, 256-262, 282-288, 335-341, 
360-366, 427-433, 529-535, 707-713 



